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Preface 


W HEN the writer prepared Statistical Me^d in Education^ 
he had in mind the needs of those in^tutions that offer 
more than one semester’s work in the subject and of those 
individuals who wish to prolong their study beyond a first 
course. Experience has revealed, however, that a more limited 
treatment finds favor with many persons, since they do not 
desire to pursue the subject beyond what can be covered in a 
one-semester course and therefore do not care for a larger and 
more comprehensive book than is necessary for that purpose. 
He has written this volxune for such persons, including in it 
only as much as his students cover in one semester. He has 
added a few suggestions for further study, so that those who 
wish to carry on will know where to turn for assistance. 

Much of the content is relatively well known ; new formulas, 
procedures, and other matters appearing appropriate in an 
elementary text have been added. The presentation avoids 
higher mathematics, presupposing no preparation therein 
beyond arithmetic and a small amount of high-school algebra 
and geometry. The chief abilities needed to master the con- 
tent are the power to think clearly, the accurate and reasonably 
rapid performance of the fundamental operations with integral 
and decimal numbers, and the habit of careful, orderly work. 

The content is arranged in the order which the writer prefers, 
but instructors may wish to make changes. For example. 
Chapters XIII and XIV contain material part or all of \yhich 
may be introduced earlier with little difficulty. For those 
who desire to cover even less material than the volume presents, 
the writer suggests Chapters Xl and XII and the topics on 
the geometric mean, the harmonic mean, estimated true scores, 
determining difficulties of test elements, and equally noticed 
differences as portions that may be omitted. 


C. W. Odell 
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CHAPTER I 


Introduction 


Purpose and plan of this treatise 

In preparing this volume the author has had in mind that 
there are three chief groups of persons engaged in educational 
activities for whom acquaintance with statistical methods is 
essential. One is composed of research workers, that is, pro- 
ducers. A very necessary step in the development of any 
science is the application of methods of measurement and com- 
putational procedures to experimental and other research work 
therein. Recognition of this fact, with consequent develop- 
ment and application of statistical methods, has come rather 
recently in education and the other social sciences, but it is now 
widespread. No one can be considered competent to conduct 
many types of research unless he has command of statistical 
principles and procedures. Their use renders it possible to give 
brief, exact descriptions and convenient, meaningful summaries; 
tends to make thinking definite; provides bases for drawing 
conclusions and for making estimates; and enables workers to 
analyze and simplify complex data. 

A second and much larger group is that of teachers, adminis- 
trators, and others who devote their efforts chiefly to the day- 
by-day operation of the educational system. These consumers 
need both the ability to select and employ those statistical 
methods which contribute to the efficient performance of their 
duties and also sufficient understanding of statistical terminol- 
ogy and procedures so that they can mentally digest and then 
apply the results of studies carried on by others. Modern 
educational literature is replete with statistical terms. Such 
terms, as well as the applications of statistical procedures, occur 

1 
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in almost all aspects of educational activity. They are found 
in studies and discussions of the curriculum and the extracurric- 
ulum, of buildings and equipment, of pupils and teachers, of 
textbooks and other supplies, of finances, and so on almost 
without end. They are frequent in school surveys, in guidance 
and counseling, in diagnosis and prognosis, in reports on 
absence and tardiness, in studies of progress and elimination, 
and so forth. Indeed, no important phase of our school system 
operates without some application of statistics. 

The third group is composed of students, chiefly graduate 
but also undergraduate. Their needs are similar to those of 
the second and also, for many individuals, include those of the 
first group. It is for this last category, including both present 
and future consmners and producers, that this volume is chiefly 
intended. It is, therefore, its purpose to present as many 
fundamental procedures, principles, and terms as the typical 
class of advanced undergraduate or graduate students withoxit 
especial training in mathematics can cover in a one-semester 
course. The treatment has been made as nonmathematical as 
is consistent with the objectives and content, emphasizing prac- 
tical computation, use, and understanding, rather than deriva- 
tions and theoretical considerations. The mathematical ability 
most needed is that to perform the four fundamental operations 
accmately and fairly rapidly with integral and decimal num- 
bers. Also, a few elements from high-school algebra and plane 
geometry are necessary, but no mathematics beyond that. 

The author hopes that all those who study this treatise will 
gain therefrom some enhancement of each of the several abilities 
listed below: 

To select appropriate procedures and measures; 

To compute measures; 

To interpret measures reported by others ; 

To associate terms and concepts; 

To present results. 

General suggestions concerning statistical procedure 

Statistical method is not a substitute for thought, but an 
aid — often an essential aid— to thought. It is a means, not an. 
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end. It should be employed only to the extent needed for the 
solution of the problem at hand or for the conclusion of the 
desired research. Graduate students and other beginners in 
educational research are frequently prone to employ unneces- 
sarily elaborate and complicated statistical procedures in the 
belief that so doing increases the value of their results. Instead, 
such statistical measures and computations as most directly 
and simply reveal the significant facts should be used. 

One violation of the principle just stated is the attempt to 
correct the inadequacy of data by overrefined statistical pro- 
cedures. In general, measures derived from data are no more 
reliable and vahd than the original data. Certain exceptions 
to this will be noted in appropriate connections later. They 
are chiefly concerned with types of errors the effects of which 
tend to balance one another or to be negligible when some 
measures are computed. Moreover, increasing the number of 
cases may or may not serve to eliminate errors, depending upon 
what type of errors is involved. 

In general, the statistical work involved in a study should be 
planned before it is begun. The plan should provide for col- 
lection of data, preliminary analysis and tabulation thereof, 
classification and sulisequent tabulation, computation, inter- 
pretation, and presentation. Minor modifications will often 
prove desirable; but, if a worker finds that he is constantly 
making major additions or alterations to his originally planned 
statistical procedures, he may be sure that his initial planning 
has been too brief or otherwise unsatisfactory. He should 
consider carefully such matters as the kind of data to be used; 
how they are or may be interrelated; how they bear upon the 
question or questions at issue; how accurate, adequate, repre- 
sentative, and otherwise valid they are; how they are to be 
analyzed; what measures can and should be computed from 
them; how they can be so collected, expressed, and tabulated 
as to be most conveniently employed ; to whom the conclusions 
are to be presented and therefore in what tabular, graphic, and 
other forms they should be arranged; and other similar points. 

No one can become a competent statistical worker who does 
not possess the capacity, both intellectual and temperamental. 
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for careful, thorough, unbiased, accurate work. This includes 
both the computational and the reasoning aspects of the work. 
An individual competent in only one of these two phases cannot 
become an efl&cient statistician. His best recourse is to col- 
laborate with someone competent in the other phase — an 
arrangement that may be satisfactory. Moreover a worker 
cannot employ statistical method constructively imless he has 
a wide knowledge of the field and can think imaginatively. 
Insofar as the degree of computational ability necessary to 
accurate xise of the formulas and methods given in this book is 
concerned, almost every student at all likely to be studying 
educational statistics can develop it by sufiicient practice. 


Compufation 

Even though the ability to compute accurately and with fair 
rapidity is essential for efficient statistical work, anyone who 
expects to do much in this field should expedite his labors by 
employing computational aids. These fall into two general 
classes: tables and mechanical devices. Numerous types of 
both are available. 

The one most useful set of tables for workers in educational 
statistics is probably that known as Barlow’s.^ This gives the 
squares, cubes, corresponding roots, and reciprocals of all 
integers up to 12,500. By interpolation, both nonintegral and 
larger numbers may be similarly dealt with. Less extensive 
tables may be almost as useful in elementary work. Multipli- 
cation tables extending up to such limits as 1,000 X 1,000 or 
100 X 10,000 are available, but have not received much use 
among American statisticians. In some computations, not 
many of which are among those included in this book, logarith- 
mic tables are helpful ; in a few they are indispensable. 

In addition to tables such as have been mentioned, which are 
widely useful in numerous types of work, there are others of 
more limited scope insofar as their area of usefulness is con- 

^ Barlow, Peter, Tables of Squares^ Cvhes^ Square Roots, Cube Roots, and Recipro- 
cals of All Integer Numbers Up to 12.600. Fourth Edition. London: E. and 
F. N. Spon, 1941. 258 pp. 
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cerned. Pearson’s® sets of tables include a number concerned 
with functions of the normal curve and also others frequently 
employed. One of them — Sheppard’s Table of the Probability 
Integral — is the commonly used source table for many pro- 
cedures and computations. 

Still more limited are groups of tables prepared specifically 
for the use of educational, or educational and psychological, 
workers. These groups tabulate values for formulas, opera- 
tions, and functions frequently needed by such persons. Many 
statistical texts contain one or more tables of this sort. Three 
sets will be mentioned here. Dunlap and Kurtz® are the 
authors of a handbook of three parts that deserves high rank. 
The first part presents almost thirty nomographs, or graphic 
computational aids, useful for securing values often desired in 
educational work. The second contains a dozen tables, some 
of general computational nature and others more or less pecu- 
liar to education. The third part lists more than four hundred 
formulas and two hundred symbols used in this field. Hol- 
zinger^ has compiled a set of a* dozen tables, similar to those in 
the second part of Dunlap and Kurtz’s book, that represent a 
well-chosen combination of the two types. Kelley’s® set has 
only five tables; but, since the most important of them — 
Number I — contains values of several functions, they cover 
about as wide a field of usefulness as those of Holzinger. 

The chief mechanical aids to computation are slide rules and 
computing machines. For those familiar with their use, slide 
rules, such as engineers commonly employ, are quite helpful in 
multiplication, division, and some other operations. A ten- 
inch rule gives approximate accuracy to three figures; a 
twenty-inch rule, to four or five figures. Dunlap and Kurtz 
have devised an inexpensive slide rule which may be used as 

* Pearson, Karl, Editor, Tables for Statisticians and Biometricians. Cambridge: 

Cambridge University Press. Issued by Biometric Laboratory, University Col- 
lege, London. Part I, Second Edition, 1924. 143 pp. Part II, 1931. 262 pp. 

* Dunlap, Jack W., and Kurtz, Albert K., Handbook of Statistical Nomographs, 

Tables, and Formulas. New York: World Book Company, 1932. 163 pp. 

^ Holzinger, Karl J., Statistical Tables for SHidents in Education and Psychology. 
Fifth Impression. Chicago: University of Chicago Press, 1938. 104 pp. 

® Kelley, Truman Lee, The Kelley Statistical Tables. New York: The Mac- 
millan Company, 1938. 136 pp. 
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suggested above and also for finding values of a number of 
functions often needed in educational and psychological work.® 
Enlow has constructed a more elaborate and durable — also 
more expensive — rule of the same general type J 

Some computing machines are hand-operated; others are 
electrically driven. Those of the latter type are superior but 
much more expensive. The machines vary in scope of useful- 
ness from simple adding machines to those with such features 
as automatic multiplication and division. Among the leading 
varieties are the Comptometer, the Burroughs, the Dalton, the 
Friden, the Marchant, and the Monroe. 

Still more complicated than these, and more useful for certain 
operations, are the card-punching and automatic sorting 
machines, such as the Hollerith and that of Warren and 
Mendenhall. For a project involving multiple classification of 
many cases such a machine is highly desirable. It will not only 
sort the data, but also will yield results that greatly facilitate 
computing coefficients of correlation and other frequently 
wanted measures. 

A still different variety of machine is that for scoring tests. 
Several of this sort have been devised, but the only one receiving 
very much use is that of the International Business Machines 
Corporation. This electric machine will score special answer 
sheets designed for use with numerous standard tests with high 
accuracy and at a rate of several hundred per hour. 

Accuracy 

The accuracy of results obtained by statistical procedures 
depends upon the accuracy of both the original data and the 
computations to which they are subjected. Before actually 
performing the statistical operations involved in any project, 
the computer should decide how accurate he wishes the results 
to be and then take such precautions in his work as will, if pos- 
sible, insure that degree of accuracy but at the same time avoid 
such unnecessary labor as carrying more significant figures or 
decimal places than are essential. 

® This rule may be obtained from the Psychological Corporation. 

^ Enlow^s slide rule may be secured from Keuffel and Esser. 
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In the cases of some types of data, common usage has more 
or less agreed upon the degree of accuracy to which they are 
determined and reported, but in most instances this decision 
rests with the person making the study. The person or persons 
for whom the conclusions are intended is often an important 
factor. Data designed for the general public or other large lay 
groups should usually be “rounded off,” so as to make them 
easy to comprehend and remember; whereas those for small, 
highly interested groups of laymen or for professional workers 
frequently may well be more nearly exact. Too great detail 
may confuse rather than inform the lay or relatively uninter- 
ested hearer or reader. This is particularly true of numerical 
oral statements. If an audience is told that the average salary 
of teachers in one system is about $1,250, for example, whereas 
that in a group of comparable systems is about $1,400, they 
will probably get and retain a clearer impression than if they 
are given such figures as $1,245.18 and $1,406.66, respectively. 
Similarly, if they are told that enrollment is increasing about 
5 per cent per year, they will be likely to receive and remember 
the point better than if they are told that the annual increase 
is 4.86 per cent. 

There are numerous rules as to the steps necessary to insure 
desired degrees of accuracy in results and, conversely, as to how 
great accuracy results from specified procedures. At the risk 
of overemphasizing the point, a few of them will be given here. 
They may be stated in terms of either absolute or relative 
accuracy. Sometimes one is more convenient, sometimes the 
other. Absolute accuracy is expressed in terms of the actual 
amount; relative accuracy, in terms of what fraction that is 
of the number itself. Several examples of absolute accuracy 
expressed decimally will be given here and some of relative 
accuracy later. Thus 4.6 is conventionally understood to be 
accurate to one decimal place or to the nearest tenth, that is, 
to represent a value that, if more accurately given, would be 
nearer to 4.6 than to either 4.5 or 4.7. Another way of stating 
the same thing is that 4.6 represents a true value between 4.55 
and 4.65. Therefore its limit of error is .05. Similarly 23.08 
is accurate to two decimal places, or to the nearest hundredth. 
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or represents a true value between 23.075 and 23.085 ; 573. is 
accurate to the nearest unit, or represents a true value between 

572.5 and 573.5; and so on. 

If a whole number ends in one or more zeros and is accurate 
to the nearest unit, it should be followed by a decimal point; 
if not, it should be understood as accurate only to the last digit, 
not zero. Thus 2,600. should be understood as accurate to the 
nearest unit, that is, as representing a true value between 

2.599.5 and 2,600.5; whereas 2,600 should be considered 
accurate only to the nearest hundred, or as standing for some 
value between 2,550 and 2,650. Sometimes the notation 
26. X 10® is used to mean the same thing. When such a num- 
ber ends in two or more zeros of which some are accurate and 
some not, the latter method may be employed, or a dot or bar 
may be written over the last significant one. Thus 260. X 10, 
or 2,600 or 2,600, denotes accuracy only to the nearest ten, 
that is, a true value between 2,595 and 2,605. Similarly 
520. X 10®, or 520,000 or 520,000, denotes accuracy to the 
nearest thousand, or a true value between 519,500 and 520,500. 

Absolute accuracy may also be expressed in terms of signifi- 
cant digits or figures. A number has as many significant figures 
as it contains digits that are considered correct. Zeros used 
merely to indicate the position of the decimal point are not 
counted. Thus 234. and 23.4 and .00234 each has three 
significant figures; 25,000 and 6.8 and .0090 each has two; 
47,914 and 12,580,000 and .074208 each has five; and so on 
with others. 

The relative accuracy of a number is the limit of accuracy 
or error divided by the number itself. Thus, since 4.6 repre- 
sents a true value between 4.55 and 4.65, its error cannot be 
greater than .05 and its relative accuracy is .05 -f- 4.6, which 
gives .0109 or 1.09 per cent. Since 23.08 denotes a value be- 
tween 23.075 and 23.085, its limit of error is .005 and its relative 
accuracy is .005 23.08, which equals .000217 or .0217 per 

cent. 

The error in a sum or total is equal to the algebraic sum 
of the errors in the quantities added to produce it. In other 
words, it equals the difference between the sum of the posi- 
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tive errors and that of the negative errors. If all the errors 
are in the same direction,* their algebraic sum is the same 
as their arithmetical sum; hence the error in the total equals 
the latter. If, as often, the errors in the addends are as likely 

to be positive as negative,® their total tends to ap- 41 875 

proximate zero; so the sum is likely to be rather 13.2 

accurate. In general, a total is more accurate than 23 ^3 

most of the quantities that contribute to it. Its last 9.5 
certainly accurate figure, however, is no farther to the 
right than the last one in the least accurate of the 
measures added. For example, the sum of the numbers at the 
right cannot be depended upon beyond 132.6 and should be 
given in that form, not as 132.608. The same is true of a 
remainder or difference. 


Since an average is a sum divided by the number of items 
added, its absolute error is the error of the sum divided by the 
number of cases. Hence, if the errors in the items are variable, 
the error in the average is likely to approach zero. This fact 
justifies carrying an average t6 more decimal places than are 
given in the items from which it is secured. If fewer than 
10 items are added, their average should not be carried to more 
decimal places; if from 10 to 99, it may be carried to one more; 
if from 100 to 999, to two more; if from 1,000 to 9,999, to three 
more; and so on. In other words, the additional decimal 
places to which an average may be carried are one less than the 
number of digits in the number of cases. Thus, since there are 
three digits in 450, an average of 450 cases may be carried to 
two more places than the single items; since there are two 
digits in 38, an average of so many may be carried to one more; 
and similarly for others. Another rule that is practically the 
same in effect is that generally as many figures are accurate in 
an average as in the sum from which it was computed. 

The most commonly given rule for multiplication and divi- 
sion is that the number of significant figures in a product or 


* Such errors are called systematic, constant, group, biased, cumulative, or rum- 
compensating. 

• Errors of this sort are known as variable, chance, accidental, individual, random, 
unbiased, noncumulative, or compensating. 
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quotient is generally the same as in the one of the two con- 
tributing numbers that has the fewest. Frequently the last 
may be in error by 1, however, and occasionally by more. 
For example, the product of a multiplicand with four significant 
figures and a multiplier with two generally has two accurate; 
the quotient of a dividend with three and a divisor with five 
ordinarily has three; and so on. As an illustration, 24.71 has 
four significant figures accurate and 1.8 has two; hence their 
apparent product, 44.478, is probably accurate only to two 
places — 44. This can be tested- by multiplying together the 
minimum and also the maximum values that the numbers may 
represent. The former, 24.705 and 1.75, give 43.23375, and the 
latter, 24.715 and 1.85, give 45.72275. Both differ from 44. in 
the last figure. As another example, 75.94 -r- 800. = .094925, 
which is probably acemate as far as .0949. To test this, as in 
any case of division, the minimum value of the dividend should 
be divided by the maximum value of the divisor, and vice versa. 
This gives 75.935 800.5 = .094859 and 75.945 799.5 = 

.094991. The first of these reduced to four places agrees with 
.0949, whereas the latter becomes .0950. Since a power of a 
number is obtained by multiplication, this same general rule 
applies to it also. 

The accuracy of products and quotients may be dealt with 
relatively as well as absolutely. The limit of error in a product 
or quotient is the approximate algebraic sum of the limits of 
errors in the numbers involved, all errors being relative. For 
example, 1.5 X 40. X .0648 = 3.888. The limits of relative 

.05 .5 

errors of the three numbers are = .0333, = .0125, and 

^06^ ~ .0008, respectively. Their sum is .0466; hence this 

is the approximate limit of the relative error of the product, 
3.888. If it is desired, the approximate limit of its absolute 
error can be fmmd by multiplying .0466 by 3.888, which gives 
.1812. 

Powers and roots are special types of multiplication and divi- 
sion, and so may be treated as suggested above. Simpler 
procedures are available, however. A power usually has as 
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many figures accurate as its base and a root as many as the 
number from which it was extracted. The relative error of the 
nth power of any number is approximately n times that of the 
number itself, and the relative error of the nth root of any 

munber is approximately - that of the number. For example, 

TV 

if a number has a limit of error of 1 per cent, its square has a 
limit of error of approximately 2 per cent and its square root of 
approximately .5 per cent; if it has a limit of error of .3 per cent, 
its cube has an approximate limit of error of .9 per cent and 
its cube root of .1 per cent. 

With logarithms, there is no gain in accuracy in using man- 
tissas containing more than one more decimal place than the 
number of significant figures accurate in the number with the 
fewest. Thus, if the number with the least has three significant 
figures accurate, four-place logarithms are sufficient. 

In a computation involving several steps, it is usually best 
to retain more figures or decimal places than arc desired accu- 
rate in the final result, but application of the rules suggested 
and experience should guide workers in not burdening them- 
selves by keeping more than are needed. When digits are 
dropped, the last preceding digit should be increased by one if 
the first of those dropped is 5 or greater. For example, if .4174 
is reduced to two decimal places, it becomes .42; if .03846 is 
reduced to three, it becomes .038; if 24,252 is reduced to three 
significant figures, it becomes 24,300 or 243 X 10*; and so on. 
In determining per cents and other quotients, the process of 
division should always be carried far enough to indicate 
whether the last significant figure or decimal place to be kept 
is correct as first found or should be increased by 1. Thus, if 
one wishes 10 60 to the nearest hundredth, he should not 

stop with .16, the first two figures, but should go on to .166, 
which indicates that the result should be written as .17, not .16. 

We should not forget that numbers are often exact, in other 
words, accurate to an infinite number of places. For example, 
if one room has 41 pupils, another 38, and another 34, these 
numbers are exact, not approximate. Their sum is exactly 1 13 ; 
hence their average may be carried to as many places as desired. 
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A multiplier or divisor is frequently perfectly accurate, in 
which case acciaracy of the multiplicand or dividend determines 
that of the product or quotient. 

References for further study 

Although this volume deals with statistics not requiring 
much mathematical ability above the elementary-school level 
and none above that of the high-school level, yet some other- 
wise competent individuals have difficulty with that. Those 
who know or fear that they will have such a handicap are 
advised to secure and study Walker ’s“ excellent book, which 
contains a series of short chapters on the mathematics essential 
to satisfactory work in elementary statistics. Each chapter 
begins with a self-test which students are to take. A high 
score thereon indicates that a student does not need to study 
the chapter in question; a low score, that he does. In the 
latter case, after study, he takes a test found at the end of the 
chapter to determine if he has mastered it sufficiently well and, 
if not, studies it further. The topics with which the chapters 
deal are such as significant figures, short cuts in computation, 
common fractions, multiplication of polynomials, and log- 
arithms. 

Kurtz and Edgerton’s^^ dictionary of terms employed in the 
general field of statistics — ^mathematical, business, educational, 
psychological, and so forth — should be useful to many who 
use this voliune. It contains definitions and explanations of 
over two thousand terms, symbols, and abbreviations. Another 
source in which may be found a number of definitions and 
critical discussions of the value and use of statistical procedures 
is ihe Encyclopedia of Educational Research, edited by Monroe.“ 
Still another is the Dictionary of Education, compiled under the 
editorial leadership of Good.^® 

Walker, Helen M., Maihematics Essential for Elementary Statistics, New York: 
Henry Holt & Company, Inc., 1934. 246 pp. 

Kurtz, Albert K., and Edgerton, Harold A., Statistical Dictionary of Terms and 
Symbols, New York: John Wiley & Sons, Inc., 1939. 191 pp. 

“ Monroe, Walter S., Editor, Encyclopedia of Educational Research, New York: 
The Macmillan Company, 1941. 1344 pp. 

“ Good, Carter V., Editor, Dictionary of Education. New York: McGraw-Hill 
Book Company, Inc., 1945. 495 pp. 
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There are niunerous other volumes that those who wish to 
consult other books covering more or less the same range of 
content as this may use. Of these, the following are among 
those which the present writer regards as most helpful: 

Broom, M. E., Educaiiondl Statistics for Beginning Students. New 
York: American Book CJompany, 1936. 318 pp. 

Cooke, Dennis H., Minimum Essentials of Statistics, New York: 

The Macmillan Company, 1938. 271 pp. 

Garrett, Henry E., Statistics in Psychology and Education. Second 
Edition. New York: Longmans, Green & Company, Inc., 1937. 
491 pp. (This also contains more advanced material.) 

Gray, Clarence T., and Votaw, David F., Statistics Applied to Educa- 
tion and Psychology. New York: The Ronald Press Company, 
1939. 278 pp. 

Lindquist, E. F., A First Course in Statistics. Revised Edition. 

Boston: Houghton Mifflin Conpany, 1942. 242 pp. 

Tiegs, Ernest W., and Crawford, Claude C., Statistics for Teachers. 

Boston: Houghton Mifflin Company, 1930. 212 pp. 

Walker, Helen M., Elementary Statistical Mdhods. New York: 
Henry Holt & Company, Inc., 1943. 368 pp. 

Those who wish to piu^ue the study of statistics beyond the 
elementary stage which this volume presents likewise have the 
possibility of choice among many books, monographs, articles, 
and other printed sources, in both educational statistics and 
general mathematical statistics. Among volumes of the 
former type which present material distinctly in advance of 
that in this book are: 

Guilford, J. P., Fundamental Statistics in Psychology and Education. 

New York: McGraw-Hill Book Company, Inc., 1942. 327 pp. 
Holzinger, Karl J., Statistical Methods for Students in Education. 

Boston: Ginn & Company, 1928. 372 pp. 

Lindquist, E. F., Statistical Analysis in Educational Research. Bos- 
ton: Houghton Mifflin Company, 1940. 266 pp. 

Monroe, Walter S., and Engelhart, Max D., The Scientific Study of 
Educational PrMems. New York: The Macmillan Company, 
1926. 504 pp. 

The following are more general or mathematical treatments 
of statistics. The book by Mode and Part I of Kenney’s book 
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are relatively elementary; the others include much advanced 
material: 

Ezekiel, Mordecai, Methods of Correlation Analysis, Second Edition. 

New York: John Wiley & Sons, Inc., 1941. 531 pp. 

Fisher, R. A., Statistical Methods for Research Workers, Eighth Edi- 
tion. Edinburgh: Oliver and Boyd, 1941. 344 pp. 

Kelley, Truman L., Statistical Method, New York: The Macmillan 
Company, 1928. 390 pp. 

Kenney, John F., Mathematics of Statistics, New York: D. Van 
Nostrand Company, Inc., 1939. Part I, 248 pp.; Part II, 202 pp. 
Mode, Elmer B., The Elements of Statistics, New York: Prentice- 
Hall, Inc., 1941. 378 pp. 

Peters, Charles C., and Van Voorhis, Walter R., Statistical Procedures 
and Their Mathematical Bases, New York: McGraw-Hill Book 
Company, Inc., 1940. 516 pp. 

Wilks, S. S., Mathematical Statistics, Princeton, New Jersey: 
Princeton University Press, 1943. 284 pp. 

A limited number of specific references will be given at the 
end of each chapter. They are chosen to serve both those who 
wish to read other discussions of the same scope Jis those in the 
text and also those who desire more advanced treatments of 
the same topics. 


EXERCISES AND PROBLEMS 

1. Between what limits does the true value of each of the follow- 
ing lie? (a) 29.64; (b) .088; (c) 7,400; (d) 1.42; (e) 2,100; (/) 980.; 
(^) 4,216.; (h) .00067; (t) 400,000; 0)3.7914. 

2. How many significant figures arc there in each number in 
Exercise 1? 

3. What is the limit of the relative error in each number in 
Exercise 1? Carry the answers to two significant figures. 

4. Which of the following would probably cause systematic errors 
and which variable errors? (a) Use of a yardstick an inch too long; 
(6) allowing too little time for a standard test; (c) employing various 
teachers^ estimates of pupils^ weights; (d) assignment of marks by a 
teacher who is a severe marker; (e) measurement of short periods of 
time by use of a watch without a second hand; (/) determination of 
average class scores by averaging scores of five pupils chosen at 
random from each class. 

6. To how many more decimal places than the single cases may 
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the average of each of the following numbers of cases safely be carried? 
(o) 160; (b) 26; (c) 4,600; (d) 84; (e) 7,298. 

6. How many significant figures are probably accurate in 
each of the indicated products and quotients? (a) 5,000 -t- 21.5; 
(6) 220 X 14.78; (c) 1.65 X 82; (d) 31.648 ^ 5.69; (e) .0465 H- 70.; 
(J) 1.20 X 6.1. 

7. Write the product or quotient of each part of Exercise 6 only 
so far as it is probably accurate. 

8. Give, to four decimal places, the approximate limit of relative 
error of the product or quotient of each part of Exercise 6. 

9. What is the approximate limit of relative error of each indicated 

power or root, if each numb er has th e limi t of absolute error ind icated 
by its form? (a) V38.2S; (6) "'^4,000; (c) (.045)^; (d) >^^.4862; 
(e) (21)3; (eg.O)^. 

10. Write each of the following to only two decimal places: 
(a) .0971; (5) 25.963; (c) .6464; (d) 1.53497; (e) 418.305; (/) .6667. 
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Grouping Data 


Attributes and variables 

Data are frequently thought of as being either attributes or 
variables, but it is better to think of dealing with them by 
the method of attributes or by the method of variables. The 
former involves nonquantitative classification; the latter, 
quantitative. Some data are of such nature that no quantita- 
tive or numerical basis of grouping them is valid; hence they 
must be handled by the method of attributes. Most of the 
data concerned in educational statistics are of such nature that 
they can be, and usually are, handled by the method of vari- 
ables, but it is sometimes convenient to employ the method 
of attributes with them. 

The method of attributes, also called unordered, verbal, or 
qualitative classification, frequently involves grouping data into 
only two classes, but may use more. Common examples where 
only two classes are used are the classification of pupils as boys 
or girls, of teachers as men or women, of curricula as college 
preparatory or noncollege preparatory, and of activities as 
curricular or extracurricular. Examples involving more than 
two-fold classification are of color of hair as black, dark-brown, 
light-brown, or red; of curricular as general, commercial, 
agricultural, technical, or classical; and of major subjects as 
history, English, Latin, mathematics, home economics, science, 
and so on. In some of these instances numerical classes might 
be employed, but generally they would be of doubtful validity 
and not in accord with common practice. 

Among the data regularly dealt with as variables, or ordered, 
are school marks, test scores, heights, weights, ages, salaries, 

16 
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enrollments, and many others. As suggested, any of these can 
be put in the form of attributes, but usually their quantitative 
nature is still evident. Thus marks may be classified as passing 
or failing; pupils as being overweight, at weight, or under- 
weight; and so on. 

When the method of variables is used, the munber of groups 
or classes employed may range from very few up to a large 
number. In some instances the data are such that there 
cannot be more than a certain number of classes, but usually 
the number is determined by the judgment of the worker. The 
several chief considerations in such determination will be stated 
later in this chapter. 

Variables may be either continuous or discontinuous. A 
continuous variable is one capable of infinitesimal subdivision. 
There are no impossible values between the highest and lowest 
points on the scale of measurement. For example, height is 
continuous. There is no linear distance so small but that the 
difference in height between two individuals or between two 
measurements of the same individual may be still less. The 
only limitation in this respect is in our ability to measure it. 
Frequently it is neither practicable nor worth while to measure 
variables nearly so exactly as we can. One-five-millionth of an 
inch has been measured, but in determining the heights of 
children accuracy to the nearest inch is usually sufficient. It is 
especially true in the cases of such traits as mental age, quality 
of handwriting, and others for which we do not have very 
accurate means of measurement that it would be useless, even 
ridiculous, to attempt very exact determination. Such a 
datum as a mental age of 10 years, 6 months, 16 days or a 
handwriting score of 74.35 per cent is, in the present state of 
educational measurement, an absurdity. 

A discontinuous or discrete variable, on the other hand, is one 
that cannot be subdivided infinitesimally. Ordinarily the 
smallest possible difference in numbers of pupils in different 
rooms is one. If there are more than 33 pupils in a class, there 
must be at least 34.^ In many instances there are artificial 

^ In exceptional instances, such as when a pupil has part of his work in one 
class, part in another, fractional numbers of pupils may be legitimate. 
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conditions which cause variables that are intrinsically continu- 
ous to be employed as discrete. Thus in spelling no finer unit 
than the word is commonly recognized, yet pupils’ abilities 
may differ by only part of a word. If a system adopts a salary 
schedule with $50 increments, that amount becomes the least 
actual difference among salaries, although smaller ones are 
possible. 

It is very general practice to treat discontinuous data as if 
they were continuous. Usually so doing introduces some error ; 
but, if the least unit of difference practically possible is small in 
comparison with the data themselves, the error is likely to be 
negligible. 

Grouping or classifying data 

All educational data, similar to those in other fields, are 
fundamentally collections of single scores, cases, or other items. 
If the number is quite small, there is rarely need for grouping 
them. They may be dealt with as individual items so easily 
that it is scarcely worth while to group them. Moreover, it is 
probable that grouping would result in appreciable sacrifice of 
accuracy. 

Just how large the number of cases must be to justify group- 
ing cannot be stated exactly. Whenever a worker cannot 
easily keep in mind and employ in his computations the individ- 
ual items and can group them without serious loss of accuracy, 
he should do so. Thus a rural or .small-town administrator 
with five teachers in his system can easily deal with their 
salaries by listing them, preferably in order. If he moves to a 
system with twice as many teachers, he can still do so. If he 
moves on to one with twenty teachers, the task becomes so 
difficult that he may need to group them, and if he advances to 
one with forty or fifty teachers, he is exceptional if he can 
deal with the total salary situation readily without grouping 
them. 

To illustrate the fact just stated, the following salaries of the 
eighteen teachers in a certain high school may be used*: 1,400, 

*The dollar mark has been omitted here and elsewhere where its frequent 
repetition would be cumbersome. 
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1,250, 1,900, 1,600, 1,550, 1,750, 1,650, 1,350, 1,800, 1,600, 1,450, 

1.500. 1.700. 1.300, 1,450, 1,800, 1,600, 1,550. One can inspect 

these and easily determine the highest and lowest, but it is 
somewhat difficult to get and keep in mind an adequate con- 
ception of the whole group. If they are arranged in order, 
thus: 1,900, 1,800, 1,800, 1,750, 1,700, 1,650, 1,600, 1,600, 

1,600, 1,550, 1,550, 1,500, 1,450, 1,450, 1,400, 1,350, 1,300, 

1,250, the task becomes easier. If, however, the salaries of 
the thirty elementary teachers in the same system are added, 
the task becomes difficult, even if all are arranged in order of 
size. The entire forty-eight are: 1,900, 1,800, 1,800, 1,750, 
1,700, 1,650, 1,600, 1,600, 1,600, 1,550, 1,550, 1,500, 1,500, 

1,500, 1,500, 1,500, 1,450, 1,450, 1,450, 1,450, 1,450, 1,400, 

1,400, 1,400, 1,400, 1,350, 1,350, 1,350, 1,350, 1,350, 1,350, 

1.300, 1,300, 1,300, 1,250, 1,250, 1,250, 1,200, 1,200, 1,150, 


1,150, 1,100, 1,100, 1,050, 1,050, 1,000, 1,000, 1,000. 

This series of salaries may be grouped somewhat without 
any sacrifice of accuracy by Inerely tabulating the number 


of teachers who receive each amount. Such a 
tabulation is presented at the side. The column 
showing how many salaries of each amount there 
are is headed /, since each entry therein is called 
a frequency. N is the usual symbol employed for 
the total number of cases. 

Although the tabulation just given is distinctly 
easier to comprehend and employ in computation 
than the list of 48 scores, even when they are in 
order, yet it may be desirable to condense the scores 
still more. For most purposes they can be grouped 
into classes 100 in width without too great loss of 
accuracy. This has been done and the resulting 
tabulation given on page 20. In this the entries 
in the first column are not, as in the previous one, 


Salary f 

1900 1 

1800 2 

1750 1 

1700 1 

1650 1 

1600 3 

1550 2 

1500 5 

1450 5 

1400 4 

1350 6 

1300 3 

1250 3 

1200 2 

1150 2 

1100 2 

1050 2 

1000 _3 

AT = 48 


the exact salaries, but are the lower limits, abbrevi- 
ated I or sometimes ll, of the classes within which they fall. 


The dash after each indicates that the class extends all the 


distance up to, but not including, the next lower limit. For 
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example, the 975-class contains all cases from 975 up to 
but not including 1,075. In other words, its upper 
limit, abbreviated u or vl, is 1,074.99. 

The usual method of making such a tabulation 
from individual items is to determine what classes 
are to be used, list their lower limits, and record by 
tally marks, with a horizontal or diagonal line 
through each four to make a block of five, how 
many fall in each class. Figures are then substi- 
tuted for the marks.® Very few workers are so 
accurate but that, in order to insure correctness, 
they should tabulate a series of scores twice and secure agree- 
ment between the two tabulations before accepting the re- 
sult. 

There are several principles to follow in deciding what classes 
to use in a tabiilation. These principles are intended to provide 
for as great economy as possible in manipulation of data with- 
out serious sacrifice of exactness. Sometimes they conflict, 
thus rendering compromise or neglect of one principle neces- 
sary. 

As a preliminary to discussing these principles, some terms 
will be defined. A doss or group is one of the divisions into 
which the obtained range of values is divided according to size 
or, in the case of attributes, to some other characteristic. The 
lower limit of a class is the lowest possible value of any case that 
falls within it, and the upper limit is the highest. Since the 
upper limit of any class differs from the lower limit of the next 
class by an infinitesimal amount, that lower limit is generally 
substituted for the upper limit of the next lower class when 
the latter is employed in computation. Sometimes the term 
boundary is used instead of limit. 

The distance covered by a class, from its lower limit to the 
lower limit of the next class, is best called the doss interval or 
simply the interval. Sometimes doss width or just width is 


I f 

187&- 1 

1775- 2 

1675- 2 

1575- 4 

1475- 7 

1375- 9 

1275- 9 

1175- 5 

1075- 4 

975- _5 

AT = 48 


• In practice, it is frequently convenient to make the tally marks with a soft 
pencil, then write in the figures in ink, and finally erase the tally marks to leave 
a clean, neat tabulation. 
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employed. Frequently the word interval is also used as 
synonomous with doss. In addition, step is employed in both 
meanings. Usually the context makes clear which meaning 
should be attached to interval or step. Thus such expressions 
as “the 1200-interval,” “the 25-step,” and so on evidently 
mean the same as class, whereas “the interval is 100” and 
“the step is 10” refer to the width of classes, that is, to the 
distance between the limits. 

The usual recommendation as to the munber of classes in a 
distribution is that it be between ten and twenty. The use of 
less than ten is liable to result in too great loss of accuracy, 
whereas that of more than twenty does not permit ready 
comprehension of the whole distribution or sufficiently reduce 
the work of computation. Ten and twenty should not be 
regarded as absolute, however; sometimes less than ten or 
more than twenty classes may well be employed. Usually a 
large number of cases should be tabulated in more classes than 
a small number. 

Ease of manipulation increases if intervals are round, familiar 
numbers, such as 5, 10, 20, or 100, rather than such as 7, 11, 22, 
or 130. Moreover, tabulation is easier if intervals are so chosen 
that each lower limit differs from the adjacent ones in as many 
figures as possible. For example, an interval of 20, with 
accompanying lower limits of 0, 20, 40, 60, and so on, permits 
easier tabulation than one of 25, with lower limits of 0, 25, 50, 
75, and so on. In the former case, the digits in units’ place 
need not be watched at all; in the latter, they must be if the 
item falls in the 20’s, the 70’s, the 120’s, and so on. 

. The suggestion just given should not be followed when doing 
so would not produce frequency distributions that represent the 
original data with sufficient accuracy. Since the mid-point of 
a class is taken as the average value of the cases included in 
the class, intervals should be chosen with this in mind. The 
chief point to watch is that if there is a strong tendency for the 
original data to differ from one another by a uniform amount, 
the interval should be an integral multiple of this difference 
and the lower and upper limits should be so chosen that each 
falls halfway between clusters of cases. For example, in the 
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salary tabulation used before, differences are 50, or multiples 
thereof; hence the interval employed should be 60, 100, or 
some other integral multiple of 50 and the limits should end 
in 25 or 75. Since, as the data were tabulated, 975, 1,075, 
and so on are the lower limits, the mid-points are 1,025, 1,125, 
and so forth, and these are the averages of 1,000 and 1,050, 
1,100 and 1,160, and other values ending in 00 and 50, as they 
should be. 

If the smallest salary increment were 40 rather than 50, and 
larger differences were integral multiples of 40, then a class 
interval such as 80 should be employed, with limits 20 below 
the next highest or above the next lowest salary. For example, 
if the ten smallest salaries were 1,200, 1,200, 1,160, 1,160, 
1,120, 1,080, 1,080, 1,040, 1,000, and 1,000, and the total range 
such that an interval of 80 seems satisfactory, the lowest class 
should begin at 980, the next at 1,060, and so on. Thus the 
mid-points would be 1,020, 1,100, and so forth — in each in- 
stance the average of the two sizes of salary included in the 
class. 

When the principle just stated is not violated, lower and 
upper limits may well be selected so as to be integrally divisible 
by the interval. Thus, if the interval is 10, limits such as 10, 
20, 30, 40, and so on are generally preferable to 15, 25, 35, and 
so on; if the interval is 50, limits such as 200, 250, and 300 are 
usually better than those of 225, 275, and 325; and similarly 
in other cases. 

The class limits of a distribution should be so written as to 
show the accuracy of the measures tabulated. This is generally 
best accomplished by carrying them to as many decimal places 
as were used for the original measures. For example, such 
lower limits as 10., 20., 30., 40., and so on indicate that the 
original data were accurate to the nearest unit, whereas 10.00, 
20.00, 30.00, and so on indicate accuracy to the nearest 
hundredth. 

Sometimes distributions of data are such as to provide 
exceptional tabulation difficulties. For example, if in addi- 
tion to the salaries of the forty-eight teachers there is included 
also that of the superintendent, 3,750; that of the high-school 
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principal, 2,700; and those of two elementary prin- 
cipals at 2,100 and 2,000, to tabulate all in intervals 
of 100 would require twenty-eight intervals with 
lower limits ranging from 975 up to 3,675 and would 
make a lengthy distribution. On the other hand, 
to employ intervals of 200 or more would produce 
satisfactory length, but would condense the bulk of 
the data too much. The best solution is probably 
that shown at the side — to employ intervals of 100 
but not to insert them where several adjacent ones 
include no cases, that is, have frequencies of 0. 
Instead two or three dots, in vertical arrangement, 
are inserted to indicate where each omission oc- 
curs, thus resulting in a tabulation such as that 
shown. 


Salary f 
3675- 1 

2675- 1 

2075- 1 

1975- 1 

1875- 1 

1775- 2 

1675- 2 

1575- 4 

1475- 7 

1375- 9 

1275- 9 

1175- 5 

1075- 4 

975- _5 
N = 52 


Occasionally an even more exceptional set of data almost 
or absolutely requires the use of intervals of unequal width. 

Otherwise there is no possible way to avoid either 
1 000 000 - great condensation or very cumbersome length. 

’ 100 ^ 000 - General income statistics are an excellent example of 
M 0 ^ such a situation. If any reasonably small number of 
25 ^ 000 - equal intervals is used, the lowest one extends so 

20 , 000 - high as to include many more than 99 per cent of 
lo’ooo- fhe total number of cases. Therefore several widths 

5.000- of intervals are employed. A possible set of lower 
3 0 ^ limits is given at the left. In this case the higher 

2 . 000 - class has an indefinite upper limit. Another com- 
mon example in many states or other sections is 
high-school enrollments, the large majority of which 

are quite small but some of which extend into the thousands. 
In the frequency distributions so far presented only the 
lower limits are given. Some persons prefer to 
1776-1874 99 Upper limits as well, presumably for 

1675.-1774.99 the sake of clarity, but to the present writer there 
does not appear to be sufficient gain 
in so doing to warrant the practice. If it is done, 177^1375 
the limits of the highest three classes in the salary 1676-1775 
tabulation of the forty-eight teachers appear as at 
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the left. A somewhat similar practice that should never be 
followed is to write them as shown at the right. This form does 
not make clear where a salary of 1,775, for example, should be 
tabulated. 

Sometimes instead of using limits to define the classes, work- 
ers give their mid-points as illustrated here. Such mid- 
1^5 called doss marks, class types, doss indices, 

1726 or /ace values. The practice is statistically correct if 
properly understood, but both renders tabulation more 
difficult and is liable to misunderstanding. 

Before closing the discussion of tabulation and classification 
let us carry through an actual example. For this purpose the 
following set of total scores of a group of pupils upon a series 
of short tests may be employed: 214, 190, 194, 192, 267, 244, 
219, 228, 195, 274, 248, 368, 219, 244, 239, 192, 301, 261, 234, 
258, 215, 247, 236, 396, 307, 188, 242, 266, 232, 224, 257, 243, 
326, 221, 311, 200, 233, 300, 244, 342, 237, 372, 184, 331. 
Inspection of these scores reveals that the smallest is 184; the 
largest, 396; hence the range is 396 — 184 = 

212. Since they appear to be fairly well 
distributed over the whole range and no other 
complicating or exceptional condition is 
apparent, an interval of 20, which will make 
eleven classes, with limits into which it is in- 
tegrally divisible, seems satisfactory. Such 
limits are therefore set up and the scores 
tabulated by tally marks, as indicated. For 
the first score, 214, a tally is placed after 
200- ; for the next, 190, a tally is placed 
after 180- ; and similarly for all forty-four. This should then 
be repeated and the two sets of tallies compared to see if they 
agree. If so, they are probably correct and may be accepted. 
Figures should then be substituted for them, either at the side, 
as shown here, or wherever else may be convenient, and the 
tabulation is ready for use. 

Assumed location of grouped cases 

The assumed location of grouped cases is quite important in 


Score f 

380-1 1 

360-11 2 

340-1 1 

320-11 2 

300-1111 4 

280- 0 

260-1111 4 

240-1144 nil 9 

220-T«4 nil 9 

200-tH4 5 

180-1444.11 7 


iNT - 44 
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several connections. Since cases lose their original exact 
values when grouped in a distribution, some assumption as to 
their values must be made as a basis of many of the computa- 
tions and interpretations involving them. Although another 
assumption would often be more accurate, the one regularly 
made is that the cases in a class are symmetrically and equally 
distributed throughout that class. This means that each class 
interval is divided into as many equal parts or distances as there 
are cases in that class and one case assmned to be at the mid- 
point of each part. 

The assumption just stated may be illustrated by Figure 1, 

I I » ' I » ‘ I 1 -** -I » I 1 

300 320 340 360 380 400 

Fig. 1 . Graphic Representation of Assumed Distribution 
of Cases in a Grouped Distribution 


which represents graphically the five classes at the top of the 
distribution used previously, with the assumed position of each 
case shown. Since the class extending from 300 up to 320 
has a frequency of 4, it is divided into 4 equal parts, and at the 
mid-point of each is an x, to represent the case assumed to be 
there. Since one-fourth of 20, the interval, is 5, the first part 
extends from 300 to 305 and the case at its mid-point is at 
302.5. The second part extends from 305 to 310, so the second 
case is at 307.5, and similarly the others are 312.5 and 317.5. 
The 320-class contains 2 cases; hence it is divided into 2 parts 
covering from 320 to 330 and 330 to 340, respectively, and the 
cases are at their mid-points, 325 and 335. The 340-class has 
but 1 case, which therefore is at its mid-point, 350. The 
2 cases in the 360-class are at 365 and 375, and the 1 in the 
380-class is at 390. 

In the instances just presented it is easy to determine the 
assumed value of each case, but when the class frequency is 
large it is difficult to do so by mere inspection. The values 
found in the last paragraph show that the first case in a class 


is always ^ of the interval above the lower limit and that each 
c 4 / 
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successive case is y of the interval larger than the case Just below 

it. In formula form, the assumed value of any case in a class is 

I + - j - '- which may also be written I + lo this I 

is the lower limit of the class involved, / is its frequency, i is 
the interval, and n is the ordinal number, or position within 
the class of the case the value of which is desired. For the 
value of the second case in the 300-class this formula gives 


2 — 5 

300 4- - ■ ■ 20 = 307.5, the result already secured by 

inspection. As another example, the assumed value of the 
third case in the 220-class of the distribution of scores is 


230 -b - - Q-- 20 = 225.56. 

For the first case in a class it is convenient to use a simplified 
form of the formula. Substituting 1 for n gives 


1 + 


2x1-1 

2 / 



as the value of the first case. For example, the first case in 

20 

the 200-class is 200 + Jq = 202. Similarly the formula for the 


last case is u — u being the symbol for upper limit. Thus 


20 


the last case in the 240-class, for example, is 260 “ Js == 258.89. 


It is also convenient to remember that if the frequency in a class 
is an odd number the middle case always comes at the mid- 
point of the class. For example, the fifth or middle one of the 
9 cases in the 220-class is 230. 


Meaning of numerical data 

The numerical expressions of scores and other data do not 
always have the same meaning. There are two chief interpre- 
tations. In some — ^probably most — cases, a score or other 
datum of a given value means at least that much, but not so 
much as the next higher value. For example, a score of 46 
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on a spelling test regularly means 46 words spelled entirely 
correctly, with perhaps some others partially right, but not 
47 correct. The same is true for scores on almost all varieties 
of short-answer tests, both standard and informal. 

The other common but probably less frequent type is that 
in which a score means nearer the given value than to the next 
one below or above it. For example, the directions for scoring 
specimens of pupils’ handwriting by the Ayres Handwriting 
Scale, which has samples at intervals of 10 from 20 to 90, 
provide that each be given the value of the sample which it 
most resembles. In other words, specimens rated between 
25 and 35 are given 30, those between 35 and 45 are given 40, 
and so on. The same interpretation is usually given to per-cent 
marks on essay examinations, to intelligence quotients, and to 
various other types of data. 

In the cases of some varieties of data there is no general 
agreement in practice as to which is the meaning of the data. 
For example, an age of 11 may mean 11 but not 12 years, or 
it may mean nearer to 11 than to either 10 or 12. 

It is important that whoever is concerned with a set of data 
know which interpretations and results are appropriate for 
them. Misinterpretations and consequent errors frequently 
result because workers fail to make clear which type of data 
they are employing. Often the source of confusion has been 
failure to give class limits n such a way as to indicate the 
actual meaning of the data. If they are of the second type, 
such that a given score means nearer to the stated value than 
to those just below and above it, the class limits should be 
stated so as to make this fact evident. Thus for scores on the 
Ayres Handwriting Scale lower 1 mits of 15, 25, 35, and so on, 
rather than of 20, 30, 40 and so forth, should be employed. 
Similarly for ordinary per-cent marks, since one of 60 means 
from 59.5 to 60.5, one of 61 from 60.5 to 61.5, and so forth, 
lower limits for intervals of 5 should be 59.5, 64.5, 69.5, and 
so on, rather than 60, 65, 70, and so on. 1 

The safest practice to follow in interpreting data is to take 
the given limits as the true limits unless it is definitely known 
that they are not. In other words, mid-points should regularly 



28 


GROUPING DATA 


be taken halfway from one lower limit, as given, to the next, 
unless it is certain that they should be taken otherwise. 


Cumulative frequency distributions 

The type of frequency tabulation or distribution described 
and employed above is the most usual variety employed. It is, 
however, sometimes helpful to make use of cumulative distribu- 
tions. The entries or frequencies in such distributions show 
the ntunber of cases in and below, or in and above, each class 
instead of just the number in it. Such a tabulation is com- 
monly made from an ordinary one of the kind presented above. 
The frequencies therein are sxunmed continuously from one 
end or the other, but more often from the lower end, to give 
the cumulative frequencies. Such a tabulation is illustrated in 
Table I. The first two columns merely repeat the distribu- 
tion found on p. 24. The third contains the cumulative fre- 

TABLE I 

Cumulative Frequency Distribution, 

Cumulated prom the Bottom Up 


Score / 

cumf 

cum % 

380- 1 

44 

100.0 

360- 2 

43 

97.7 

340- 1 

41 

93.2 

320- 2 

40 

90.9 

300- 4 

38 

86.4 

280- 0 

34 

77.3 

260- 4 

34 

77.3 

240- 9 

30 

68.2 

220- 9 

21 

47.7 

200- 5 

12 

27.3 

180- 7 

7 

15.9 

11 




quencies obtained by successively adding the ordinary frequen- 
cies from the bottom up. The cumulative frequency in the 
lowest class is always the same as its ordinary frequency, in 
this instance 7. That in the next or 200-class is 12, the sum 
of its frequency, 6, and the cumulative frequency, 7, of the 
class below. The cumulative frequency in the 220-class is 21, 
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which is its frequency, 9, plus 12, the cumulative frequency of 
the 200-class. Similarly the others are found, finally that of 
the highest class being equal to N, here 44. 

The last column is not necessary, but is often useful. It 
contains the ciunulative frequencies expressed as per cents of 
the total number of cases. In other words, each entry is the 
corresponding cumulative frequency divided by 44. It may 

also be obtained by finding ^ and multiplying by the cumu- 


lative frequency. 


Here 


100 100 
iV “ 44 


2.273. Seven times this 


= 15.9, the first cumulative per cent; 12 X 2.273 = 27.3, the 
next, and so on. 

In interpreting such a cumulative table readers should note 
that since the cumulative frequency for each class represents 
the number of cases in and below that class, it signifies that 
there are that many cases below the lower limit of the next 
higher class. Thus the cumulative frequency of 7 for the 
180-class indicates that 7 cases are below 200, that of 12 for the 
200-class that 12 are below 220, that of 21 for the 220-class that 
21 are below 240, and so on. The same interpretation, of 
course, applies to the cumulative per cents as well as to the 
frequencies. 


TABLE II 

Cumulative Frequency Distribution, 
Cumulated from the Top Down 


Score / 

cu7n f 

380- 1 

1 

360- 2 

3 

340- 1 

4 

320- 2 

6 

300- 4 

10 

280- 0 

10 

260- 4 

14 

240- 9 

23 

220- 9 

32 

200- 5 

37 

180- 7 

44 

AT* 44 
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Although frequencies are cumulated from the top down much 
less often than from the bottom up, the procedure is illustrated 
in Table II. The cumulative frequency of the highest class is 
1, the same as its ordinary frequency. That of the next highest 
is 3, from 1 + 2; that of the next is 4, from 3 + 1; and so on. 
Finally that of the lowest class is equal to AT, 44. In this case 
each cumulative frequency indicates the number of cases above 
the lower limit of its class. Thus there is 1 case above 380; 
there are 3 above 360, 4 above 340, 6 above 320, and so on. 
Cumulative per cents may also be found here, but they have 
not been computed and included in the table. 


EXERCISES AND PROBLEMS 


1. Which of the following are usually best dealt with as attributes 
and which as variables? 


(а) Places of birth 

(б) Grade enrollments 

(c) Amounts of tardiness 

(d) Subjects failed by pu- 

pils 

(e) Colleges attended 


(f) Ratings of textbooks 
(gi) Professional training of 
teachers 

(h) Nationalities of pupils 

(i) Quality of handwriting 

(j) Causes of absence 


2. Tabulate the following ages in a frequency distribution: 

15 - 6 , 16 - 1 , 14 - 11 , 17 - 2 , 14 - 8 , 15 - 10 , 16 - 4 , 13 - 9 , 14 - 4 , 17 - 5 , 14 - 10 , 16 - 3 , 
17 - 9 , 18 - 4 , 16 - 9 , 14 - 10 , 16 - 6 , 17 - 0 , 16 - 7 , 15 - 5 , 17 - 1 , 18 - 7 , 14 - 6 , 19 - 0 , 
14 - 2 , 15 - 8 , 16 - 7 , 17 - 4 , 17 - 6 , 17 - 8 , 14 - 9 , 15 - 1 , 17 - 8 , 16 - 10 , 13 - 5 , 15 - 10 , 

14 - 7 , 16 - 4 , 18 - 2 , 15 - 11 , 19 - 3 , 14 - 8 , 16 - 0 , 17 - 4 , 15 - 3 , 17 - 0 , 15 - 9 , 16 - 2 , 

15 - 1 , 15 - 11 , 16 - 8 , 18 - 4 . 

3. Tabulate the following school enrollments in a frequency dis- 
tribution: 

71 , 18 , 243 , 95 , 51 , 26 , 76 , 141 , 428 , 48 , 36 , 78 , 122 , 341 , 50 , 16 , 65 , 108 , 
254 , 29 , 94 , 35 , 1 ^., 33 , 66 , 209 , 37 , 82 , 8 , 136 , 81 , 841 , 59 , 30 , 74 , 76 , 34 , 40 , 
70 , 181 , 21 , 389 , 172 , 13 , 88 , 37 , 62 , 94 , 100 , 11 , 42 , 31 , 64 , 383 , 55 , 41 , 17 , 
29 , 36 , 23 , 75 , 162 , 198 , 68 , 44 , 21 , 71 , 40 , 86 , 184 , 645 . 

4. Tabulate the following ratings of school buildings in a frequency 
tabulation: 

863 , 794 , 623 , 852 , 921 , 781 , 724 , 605 , 766 , 841 , 702 , 675 , 773 , 841 , 809 , 
730 , 934 , 622 , 556 , 687 , 752 , 834 , 764 , 825 , 677 , 842 , 738 , 872 , 696 , 571 , 
832 , 768 , 718 , 864 , 758 , 660 , 683 , 819 , 736 , 628 , 904 . 

6 . Tabulate the following intelligence quotients in a frequency 
tabulation: 

88 , 102 , 95 , 79 , 106 , 89 , 97 , 99 , 124 , 110 , 85 , 98 , 106 , 92 , 113 , 83 , 94 , 99 , 
108 , 76 , 93 , 145 , 104 , 95 , 101 , 80 , 100 , 73 , 97 , 109 , 125 , 94 , 116 , 87 , 93 , 
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107, 104, 116, 93, 81, 114, 95, 87, 104, 126, 94, 109, 103, 99, 107, 109, 87, 

94, 90, 106, 119, 127, 107, 103, 74, 88, 93, 77, 104, 94, 109, 98, 103, 117, 86, 

93, 97,91, 106, 115, 158, 93, 88, 104. 

6 . Make the cumulative frequency distributions, both up and 
down, of the data in Exercises 2, 3, 4, and 5. Also find the cumulative 
per cents for each. 

7 . Find the assumed value of each indicated case in the accom- 
panying distribution. 

(а) The first case in the 50-class. 

(б) The second case in the 65-class. 

(c) The seventh case in the 60-class. 

(d) The one case in the 45-class. 

(e) The highest case in the 70-class. 


Score f 
75- 2 
70- 4 
65- 5 
60- 9 
55- 8 
50- 2 
45- 1 
40-_l 
N=32 


8 . Find the assumed value of each indicated case in the accom- 
panying distribution. Score f 


(а) The third case in the 500-cIass. 

(б) The lowest case in the 750-class. 

(c) The eighth case in the 550-cIass. 

(d) The fifth case in the 650-class, 
(c) The fourth case in the 400-class. 


850- 1 
800- 0 
750- 6 
700-12 
650-18 
600-20 
550-15 
500- 7 
450- 2 
400-_5 
AT ='86 
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CHAPTER III 


Graphic Representation of Frequency 
Distributions 


The coordinate axis system and frequency curves 

Although most of the graphic representations of frequency 
distributions belong in only one quadrant — ^the first — of the 
coordinate axis system, the worker in this field should have at 
least an elementary understanding of this system, so widely 
rised in mathematical work. Figure 2 illustrates it. It con- 
sists of two perpendicular lines, the horizontal one of which 
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is known as the X-axis and the vertical one as the Y-axis. 
Their point of intersection is the origin, abbreviated 0, and is 
the zero point from which horizontal and vertical distances are 
measured. Horizontal or X distances to the right of the origin 
are positive, those to the left negative. Vertical or Y distances 
above the origin are positive, those below it negative. 

The four divisions of space formed by the two axes are known 
as quadrants. They are numbered ordinally as shown on 
Figure 2. The + and — marks in them indicate the signs 
of points located in them, the first mark being the sign of their 
X distance and the second of their Y distance. Thus in the. 
first quadrant both X and Y distances or values are positive, 
in the second X is negative and Y positive, and so on. 

The most frequent use of the axis system is to describe points 
by their location. This is done by giving two figures. The 
first of these, called the abscissa, shows the X distance, and the 
second, called the ordinate, shows the Y distance. For exam- 
ple, the location of point A Is commonly given thus: 3,2, 
which means that it is 3 units to the right of the F-axis and 

2 units above the A-axis. Similarly point B (4,-1) is 4 units 
to the right and 1 below; C (—5, 3) is 5 units to the left and 

3 above, and D (—4, —6) is 4 units to the left and 6 below. 

When, as is very conunon, the two variables to be repre- 
sented graphically are the amount of a trait or characteristic 
and the frequency in each class thereof, regular practice is to 
use the X or horizontal direction to represent the amount of the 
characteristic and the Y or vertical direction for the frequencies. 
Since there are rarely cases with less than none of the charac- 
teristic concerned and never frequencies below 0, both variables 
regularly have positive values and the graph representing them 
lies in the first quadrant. 

To represent a frequency distribution graphically, one 
should construct horizontal and vertical scales divided into 
units of such sizes as make its significant features readily 
apparent. In general the width and height of the graph may 
well be about equal. The scale for the amount of the charac- 
teristic should begin about one interval below the lowest case 
and^ extend about one above the highest. If its lower end is 
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not at 0, there should be a short broken line extended to the 
0 point. The vertical or frequency scale, ordinarily erected 
at the 0 point, should be as high as needed, or slightly higher, 
to represent the largest frequency. Both scales should be 
clearly labelled. These and other desirable features of graphs 
will be illustrated in the next few figures. The three common 
forms of the ordinary frequency curve or graph — ^the histogram 
or column diagram, the polygon, and the smooth frequency 
curve — ^will be presented in the following sections. 


The histogram or column diagram 

The histogram or column diagram is composed of a series of 
rectangles, each one interval in width and of proper height to 
correspond to the frequency of the interval it represents. The 
lines dividing one rectangle from another are generally omitted, 
but may be drawn in to emphasize the division into classes. 
Figure 3 contains such a graph. Its height above each class is 


number 



Fig. 3. Histogram or Column Diagram Repraienting the 
Data Shown at the Right 
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that indicated by the corresponding number of cases, 7 above 
the 180-class, 5 above the 200-class, and so on. 

The easiest way to construct a histogram, after choosing and 
laying off the two scales in convenient dimensions, is to draw 
all the lines running in one direction and then all those in the 
other. Which set is drawn first is immaterial. 

In rare instances histograms are presented with axes reversed, 
so that the scale for whatever is measured appears vertically 
and that for the frequencies horizontally. An 
adaptation of this that is convenient in the 
preparation of typed copy is the use of as many 
a;’s, o’s, or other symbols as there are cases in 
each class. The result of this for the same 
distribution represented by the histogram is 
shown at the right. 

An advantage of the histogram over both 
the frequency polygon and the smooth fre- 
quency curve is that it can rea'dily be employed 
to show individual cases. For this purpose the rectangle above 
each class interval is divided into as many sections, one above 
the other and usually square, as there are cases therein. 
Either one of two facts pertaining to each case, or both, may 
be shown. The unit sections may be labelled by number, letter, 
or otherwise so that the one representing each case can be identi- 
fied,* or the exact value of each case may be inserted, or both. 

Figure 4 illustrates such a use of a histogram. It represents 
the scores of 32 pupils upon a 200-element test. The exact 
scores are shown, with those in each class arranged in order 
from lowest up to highest. Thus one pupil made 102, one 105, 
and so on up to one who scored 185. If each pupil has been 
informed as to his score, there is no need for using identifying 
labels. 

In such a graph no scale of frequencies is necessary, since 
the cases can readily be counted. If, as here, the actual scores 
are given, the base-line scale does not need to be labelled. If 
they are not, it should be, just as for an ordinary histogram. 

^ When pupils^ scores are presented thus, it is usually best to label them so that 
only the pupil concerned and the teacher knows whose score each is. 


380-x 

360-xx 

340-x 

320-xx 

300-xxxx 

280- 

260-xxxx 

240-xxxxxxxxx 

220-xxxxxxxxx 

200-xxxxx 

180-xxxxxxx 
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143 
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161 
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181 


Fig. 4. Hiitogram Divided Into Unit Sections, Each 
Representing an Individual Case 

The frequency polygon 

The frequency 'polygon is an angular curve formed by connect- 
ing with straight lines a series of points, one above the mid- 
point of each interval at a height corresponding to the frequency 
therein. At each extreme a Une is drawn from the point above 
the last class to the base line at the mid-point of the next class 
below at the bottom, and above at the top, of the distribution. 

Figure 5 illustrates this type of graph. The line bounding it 
begins on the base line at 170, the mid-point of the class just 
below the lowest class in the distribution. From that point it 
is drawn to another at a height of 7 above 190, the mid-point 
of the 180-class; thence to one at 5 above 210, the mid-point 
of the 200-class; thence up to a height of 9; and so on until it 
ends on the base hne at 410, the mid-point of the next class 
above the last one. Since there are no cases in the 280-class, 
it comes down to and rises from the base line at 290, the mid- 
point of that class. 

A frequency polygon does not include above the base-line 
distance representing each class an area that exactly corre- 
sponds to the frequency of that class. For this reason it does 
not show the frequency in each class so readily as the histogram 
does. Its total area, however, is equal to that of the histogram, 
and it so distributes that area as probably to be more nearly in 
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NUMBER 

OF 

CASES 



FI 3 . 5. Frequency Polyson Representins the Same Data 
as the Histosram in Figure 3 

accord with the true distribution of the individual data than the 
histogram is. This is even more likely to be true if the actual 
data employed are a sample of a large number which the graph 
is to represent. 

The smooth frequency curve 

The smooth frequency curve is the most graceful and artistic 
form of the three varieties, but it is also the most difficult to 
draw well. It is less accurate than the frequency polygon in 
portraying the nvunber of cases in each class, but more likely to 
approach the true shape of the distribution of single data or of 
a larger number from which a sample was selected. Such a 
curve for the same data as used for the others is shown in 
Figure 6. 

To draw such a curve, points are located just as for a fre- 
quency polygon and a smoothly flowing curved line is drawn 
through them instead of a series of straight lines. 
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NUMBER 

OF 

CASES 
IOt 

8 

6 


4- 

2 - 

oj — - 
0 160 

TOTAL SCORES 

Fig. 6. Smooth Frequency Curve Representing the Same Data 
as the Graphs in Figures 3 and 5 

Smoothing 

The method of smoothing, also called that of graduating or of 
moving or rolling averages, is sometimes applied to a distribution, 
or the curve that represents it. Its purpose is to eliminate 
minor fluctuations due to small frequencies, irregular move- 
ments in time series, poor sampling, short time cycles, or other 
causes of unrepresentative sampling or variable errors. For 
example, if 50 cases are taken as a sample from 1,000, the 
fraction of the 50 falling in each class will probably vary 
somewhat erratically from the corresponding fraction of the 
whole 1,000. If the sample is properly smoothed, the result 
will probably more nearly represent the whole 1,000 than if it 
is not. Along with its desirable effect, however, smoothing 
may reduce or even eliminate features of the distribution that 
are actual characteristics of the larger group of data and should 
be retained. Therefore it should be employed with caution. 
If one is in doubt, he probably should not smooth data. 
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The smoothing of a set of data with short time cycles or 
seasonal movements so as to reduce or eliminate them and thus 
make the long-time trend more evident is often desirable. For 
example, if we wish to study the increase in spelling ability 
from year to year and have monthly scores as our data, we 
can eliminate much of the variation from month to month and 
emphasize the general trend by proper smoothing. 

The general principle of smoothing is to replace each fre- 
quency or value by a smoothed frequency obtained by averag- 
ing several that center at the class in question. An odd number 
of frequencies, most often three, is generally taken to be 
averaged. Sometimes the middle one is given double weight 
in determining the smoothed frequency. So doing reduces the 
amount of change from the original frequencies. The chief 
situation in which even numbers of frequenc’es are averaged is 
in connection with time cycles of even numbers of units. Thus 
to eliminate monthly cycles, twelve frequencies may be aver- 
aged. Since, however, it is more convenient to employ an odd 
number, thirteen may be used. 

Smoothed frequency curves may be drawn in any of the three 
forms already presented. If a smoothed curve is to be com- 
pared with the curve of the original data, it should be drawn 
similarly to the latter. 

To illustrate smoothing by threes, the usual method. Figure 7 
presents the same data already employed, the smoothed dis- 
tribution by threes, and the frequency polygons representing 
both. The solid line is the original polygon, the same as in 
Figure 5, and the broken line the smoothed one. 

The first smoothed frequency, 6.33, is obtained by doubling 
the first frequency, 7, adding the next one, 5, and dividing by 3. 
2X74-5 

In other words, s = 6.33. The next is the average of 


the frequencies for the three successive classes, of which it 

7 5 _l_ 9 

represents the middle one. Hence it is s = 7 .00, and 


so on for the others. The last, as the first, requires doubling 


the last original frequency, hence is 


2 4-2X1 


1.33. 
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It IS evident that the smoothed distribution and curve are 
distinctly less uneven, that is, have smaller changes from class 
to class, than do the original ones. If a larger number of 
classes is averaged for each smoothed frequency the smoothing 


NUMBER 

OF 

CASES 



TOTAL SCORES 


Fis. 7. Original and Smoothed Frequency Polygont 

or levelling effect is still greater. Occasionally more than one 
smoothing is applied to the same data, which also results in 
greater reduction of differences from class to class. 

It is possible to accomplish the same result by geometric or 
graphic means, that is, by drawing the original curve and then 
constructing the proper lines. The procedure is rarely em- 
ployed, however, since for most persons it is more difficult than 
the method given. 


Cumulative frequency curves 

Cumulative distributions may be represented by curves just 
as may ordinary distributions. Any of the three forms may 
be used, but in practice the polygon or angular form is almost 
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never employed, the rectangular or “stair-step” form is some- 
what rare, and the smooth form decidedly the most frequent. 
Therefore the examples given will be in that form. Just as is 
the case for cumulative distributions, so cumulative curves or 
graphs may either be started from the bottom up, thus showing 
the number of cases below any specified point, or from the top 
down, showing the number above. The former is more 
common. 


NUMBER 

OF 

CASES 



Fig. 8. Cumulative Frequency Curve Showing Number of 
Cases Below Each Point 


Figure 8 presents a smooth curve cumulated from the bottom 
up for the same set of data already frequently employed in this 
volume. It leaves the base line at 180, the lower limit of the 
first class; rises to a height of 7 at 200, the lower limit of the 
second class; to 12 at 220; and so on .up to 44 at 400. Thus 
it shows that no cases are below 180, 7 are below 200, 12 below 
220, and so on. 

Readers will note that the horizontal or score scale is the 
same in this figure as in the several others representing the 
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same data, but that the vertical or number of cases scale 
differs. Here it must provide for as many cases as are in the 
whole distribution, whereas in the others it needs to provide for 
only so many as the greatest frequency in any one class. 

Since data are ciunulated from the top down much less fre- 
quently than from the bottom up, no graph to illustrate such 
a cumulation is given. One for the same data just used would 
have scales the same as that in Figure 8. The curve would 
begin at a height of 44 above 180, be 37 high above 200, 32 high 
above 220, and finally reach the base line at 400. If it were 
plotted on Figure 8, the two curves would intersect directly 
above the median score, that is, the score or point on each side 
of which half of the scores lie. 

Ogives or percentile curves 

There is some variation in practice in the use of the terms 
cumulative curve, ogive, and percentile curve. Some statisticians 
use them interchangeably, to refer to such a curve as that illus- 
trated !n Figure 8 and also to the same tj'pe of curve drawn 
with the two scales interchanged. Others employ cumulative 
curve as referring only to the type in Figure 8, and ogive and 
percentile curve for the variety with scales interchanged. The 
present writer favors this latter usage, hence will employ the 
two latter terms to refer to a graph representing cumulative 
frequencies with the trait measured shown on the vertical scale 
and the number of cases on the horizontal. 

Ogives may be drawn for distributions cumulated in either 
direction, but their use for those cumulated from the top down 
is so rare as to be negligible. Moreover, the definition of a 
percentile is that it is a point with a certain per cent of the total 
number of cases below it, hence the term percentile curve or 
percentile graph can properly be applied only when the cumula- 
tion has been from the bottom up. Also, ogives may be drawn 
in any one of the three geometric forms of ordinary curves, but 
are usually smooth, occasionally rectangular, almost never 
angular. 

Figme 9 presents the ogive for the same data already used. 
It shows just the same facts as does Figure 8, but with the 
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scales reversed. In addition to the scale showing the number 
of cases, a percentile scale is also laid out horizontally. It 
indicates where each tenth percentile comes, with vertical lines 
drawn up from each such point on the scale, as is frequently 
done. Thus there is a vertical line above the 10th percentile, 
which is the point below which 10 per cent of 44 or 4.4 cases lie; 

TOTAL 

SCORES 



Fis. 9. Osive of Same Data Used in Fisure 8 and 
Previous Fisures 


another above the 20th percentile or the point below which 
8.8 cases lie; and so on. Such vertical lines, by their inter- 
sections with the curve, assist in the determination of the score 
at which each percentile falls. Thus the 10th percentile line 
intersects the curve at a point even with approximately 192 
or 193 on the vertical or score scale; the 20th percentile inter- 
sects it at a point corresponding to a score of about 207; the 
30th at one corresponding to about 223; and so on. In other 
words, 10 per cent of the cases are below 192 or 193, 20 per cent 
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below about 207; and similarly for other percentiles. Some>- 
times horizontal lines are drawn to assist in determining at 
what score the percentiles fall, but this has not been done here. 

In the example just given, each tenth percentile is shown, but 
there is no unanimity of practice in this respect. Sometimes 
fewer, and sometimes more, are shown. The 25th, 50th, and 
75th are perhaps the most commonly indicated. In graphing 
test norms, a situation in which this type of curve is often 
employed, seven percentiles — the 5th, 10th, 25th, 60th, 75th, 
90th, and 95th — are frequently shown. 

In order to illustrate further the use of the ogive with test 
scores, Figure 10 has been prepared and inserted. It illustrates 


SCORE 



Fig. 10. Ogive llluibating Comparison of Scores of a Group 
with Norms for Data in Table III 


the comparison of scores made by a class of pupils with general 
norms. The solid line represents the norms, the broken one 
the scores of the class, as given in Table III. The actual num- 
bers of cases are not shown on the figure. Since they differ, 
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TABLE III 

Scores Serving as Basis of Norms 
AND Those from a Single Class 


Score 

Norms 

Class 

/ 

earn f 

/ 

cum f 

70- 

2 

2372 



65’ 

4 

2370 



60- 

10 

2366 



55- 

19 

2356 

1 

34 

50- 

56 

2337 

0 

33 

45- 

78 

2281 

0 

33 

40- 

171 

2203 

2 

33 

35- 

2U 

2032 

5 

31 

30- 

325 

1791 

7 

26 

25- 

43S 

1466 

11 

19 

20- 

413 

1028 

6 

8 

15- 

328 

585 

2 

2 

10- 

177 

257 



5- 

71 

80 



0- 

9 

9 



N - 2372 

• 

34 



both should be there if either is and their presence would tend 
to complicate the graph unnecessarily. Instead, as is usually 
preferable when distributions containing different numbers of 
cases are represented on the same graph for comparisons such 
as this, per cents are used. 

The positions of the two curves indicate not only the two 
distributions of scores, but also their comparative positions. 
Since at the 50th percentile the curve for the class is above that 
of the general norms, it is evident that on the whole the class 
scored high. Indeed, the whole lower 70 per cent of the class 
rank above the corresponding portion of pupils in general. 
The upper 30 per cent, however, are mostly below the norms. 
At both extremes the norms extend considerably farther than 
the class scores, thus indicating that the class is more homo- 
geneous in achievement than are pupils in general. 

The normal frequency curve 

Because approximations to the nonnal frequency curve result 
so often when biological — ^including educational — and many 
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other kinds of data are tabxilated and graphed, the normal 
frequency curve is of high importance. Many terms are ap- 
plied to it, among them being normal, normal frequency, prob- 
ability, error, biologic, Gaussian, and Laplacean. It is sym- 
metrical, bell-shaped, high at the center, decreasing in height 
fairly rapidly near the center, then more slowly, finally reaching 
the base line at an infinite distance from its center. 

Figure 11 presents two normal frequency curves, drawn on 
the same horizontal scale but different vertical scales. Both 



are shown to illustrate the fact that the scale may be chosen, 
according to the judgment of the person who constructs the 
curve, without robbing the curve of its normality. Each has 
been drawn to the points where it approaches the base line 
closely enough so that the space between can scarcely be shown. 

The fact that approximations to normal distributions occur 
so frequently may lead workers to assume that almost every 
set of data should take such a shape and that if one does not 
something is wrong. In many instances this is true and failure 
to approximate normality is due to inadequate or nonrepresen- 
tative sampling, to presence of errors, or to use of a scale of 
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measurement with too crude units, but in others the true shapes 
of the distributions are not normal. 

Not only do scores on most tests adapted in difficulty to the 
group tested, intelligence quotients of the general population, 
and many other varieties of data, approach normality, but the 
same is also true of variable errors, of matters of chance with 
equal odds, and of many other situations. The expansion of 
the binomial with the two terms equal yiejds an approximately 
normal distribution. The higher the power, the closer the 
approximation. Therefore such a matter of even chances as 
numbers of heads and tails resulting from tossing coins, for 
example, yields a distribution very nearly normal. 

Although no use of the formula for the normal curve is made 
in this volume, it seems appropriate to give it. It is 

-x» 

y = j/oe^'S 

in which y is any ordinate; that is, its height at any point; 
yo the maximum ordinate or height at the center; e = 2.7183, 
the base qf the Napierian system of logarithms; a; the distance 
from the mean of X; and v the standard deviation. Since 

N N 

“ WUr 2.5066(r’ 

the formula may also be written 

N ^ 

^ 2.5066(7 

There are several methods of constructing normal curves. 
One is to plot the expansion of the binomial with terms equal 
to a fairly high power, at least the sixth, and draw a smooth 
curve through the points so obtained. A curve so constructed 
will not be exactly normal, but nearly enough so for most 
purposes. 

The best and easiest method is to make use of the data found 
in tables of the curve, such as those in the Appendix. The 
first step is to decide on the dimensions of the graph, that is, 
how high it is to be at the center and how wide at the base. For 
example, we may wish to make it 6 units high and 12 units 
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wide. Therefore the two scales, vertical and horizontal, are 
laid off with these dimensions. Since, as has already b^n 
stated, the curve extends horizontally to infinity, a decision 
must made as to how far to extend it in the graph. For one 
of the size here employed, a distance of 3 standard deviations, 
or 3<r, in each direction from the center or maximum ordinate 
is far enough; hence twice that, or 6<r, is taken as equal to 
12 units, with the result that l<r equals 2 units. 

After these decisions have been made and the scales plotted 
as shown in Figure 12, the next step is to locate points through 

HEIGHT 



Fig. 1 2. Normal Frequency Curve Conifaucted by Uie of 
Data in Appendix 


which the curve may be drawn. One such point will be at the 
top of the maximum ordinate, or 6 units above the center of 
the base line. Others are found by taking a series of base-line 
distances from the maximum ordinate and finding the height 
corresponding to each. For rough work, distances at .5<r inter- 
val are sufficient; for more exact work, they should be closer 
together. In this instance an interval of .25<r will be used. 
Thus the first distance from yo to be used is .25<r. It is located 
in the first column of the standard deviation table in the 
Appendix and the corresponding entry in the second or height 
column found. This is .9692; therefore at a distance of .25cr, 
which here equals .6 unit, on each side of the center a point is 
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placed .9692 as high as the maximum ordinate or .9692 X 6., 
which equals 5.82, units high. For .5<r the corresponding 
height entry is .8825; hence at .5(r or 1.0 unit from the center 
in each direction will be a point at a height of .8825 X 6., or 
5.30, units. Similarly at .75(r or 1.5 units distance will be 
points .7548 X 6., or 4.53, units high; at 1.00<r or 2.0 units 
distance will be points .6065 X 6., or 3.64, units high; and so 
on until at 3.00- or 6.0 units distance will be points .0111 X 6., 
or .07, units high. A smooth curve is then passed through 
these points, with the result shown in Figure 12. 

Probably most normal curves which readers have occasion 
to construct will not be, as here, merely abstract, but will be 
those most nearly fitting actual sets of data. The construction 
of such best-fitting normal curves will be dealt with later, in 
Chapter XIII. 

Other symmetrical curves 

All curves of which the halves to the right and left of the 
center are alike are called symmetrical or isokurtic. Sym- 
metrical curves which also have the general characteristic of 
being highest at the center and decreasing in height toward the 
extremes are grouped into three classes on the basis of the 
degree of bunching or grouping around the center. This trait 
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Fij. 1 3 , Lcptokurtic and Platykurtic Curvet 


is called kurtosis. The normal curve, in this connection termed 
mesokurtic, is considered to have average kurtosis or bunching. 
Curves that are flatter or less bunched near the center are 
known as platykurtic, those steeper or more bunched as lepto- 
kurtic. Figure 13 illustrates these two types. 

Because of the variation in appearance due to choice of 
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different scales, moderately platykurtic and leptokurtic curves 
are often not distinguishable from mesokurtic ones by mere 
visual inspection. A formula for identifying them will be 
given later, in Chapter VI. 

A rare type of symmetrical curve, more often encountered 
elsewhere than in educational work, is the U-curve, so called 
because its shape resembles that letter. In other words, it is 
highest at the extremes, lowest at the center. An ideal form of 
such a curve is shown in Figure 14. The fact that data assume 



this shape often indicates that they are heterogeneous, com- 
posed of two distinctly different types. For example, the 
mental test scores of honor-roll and failing pupils would, if 
thrown together, probably yield such a distribution. 

Still another variety of symmetrical distribution is the 
rectangular. In it the frequencies in the various classes are the 
same, so that the upper line bounding the graph is straight and 
horizontal and, with the ends and base, forms a rectangle. 

Nonsymmetrical curves 

Curves of which the two halves, right and left, are not alike 
are called asymmetrical, allokurtic, or sometimes skew. The 
latter term, also written skewed, is better reserved for those 
asymmetrical curves that may be thought of as formed by 
pulling a normal curve out in one direction and pushing it in 
on the other side. Such a curve is said to be skewed in the direc- 
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tion in which it extends farther from its highest point. There- 
fore, as shown in Figure 15, a curve extending out to the right 
and correspondingly bunched toward the left is positively 
skewed, and one extending farther to the left and bunched 
toward the right is negatively skewed. 

A curve that is extremely skewed, so that its highest point is 
at or near one extreme and its lowest at the other, is sometimes 
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NEGATIVE 


Fis. 1 5. Skew Curves 

called a J-curve, from its resemblance to that letter. It may 
be either positively or negatively skewed. 

Moderately skewed curves are frequently encountered in 
dealing with educational data, and extreme ones occasionally. 
Tests that are too hard or too easy for the average ability of 
those tested usually yield them, as also do various other situa- 
tions. J-curves are likely to result when some factor arbitrarily 
cuts off the measures at one end or the other. 

Curve fitting 

Especially in connection with finding the type of curve 
which best fits actual data, other varieties than those men- 
tioned are often employed. A number of schemes of grouping 
or classifying curves have been proposed. For example, Karl 
Pearson, the famous English statistician who for a quarter of 
a century before his death a few years ago was generally ranked 
as the world’s leader in this area, suggested a seven-fold 
classification. The seven were the straight line, the normal 
curve, the normal ogive, the parabola, the curve of organic growth, 
the curve of organic decay, and the Gompertz curve. Since the 
ordinary worker has little occasion to do much curve fitting 
and since the use of most of these involves more mathematics 
than 1;he writer wishes to make necessary to mastery of this 
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volume, only the straight line and the normal curve will be 
discussed in this volume. The former, as the line of regression, 
will be dealt with in Chapter VIII and the latter in Chap- 
ter XIII. 


EXERCISES AND PROBLEMS 

1. Draw a histogram for each of the following distributions* 


(a) 

(b) 

(c) 

(d) 

Per cent f 

Score f 

Score f 

Score/ 

95- 2 

750- 4 

260- 3 

28- 1 

90- 3 

700- 2 

250- 8 

26- 2 

85- 5 

650- 6 

240-15 

24- 0 

80- 6 

600-12 

230-29 

22- 3 

75- 8 

550-19 

220-46 

20- 6 

70-11 

500-26 

210-82 

18-13 

65-12 

450-17 

200-75 

16-16 

60- 7 

400- 5 

190-63 

14-14 

55- 9 

350- 0 

180-58 

12-11 

50- 4 

300- 2 

170-35 

10- 7 

45- 1 


160-16 

8- 5 

40- 1 


150-12 



140- 9 
130- 2 


2. Draw a frequency polygon for each of the distributions in 
Exercise 1. 

3. Draw a smooth frequency curve for each of the distributions 
in Exercise 1. 

4. Draw a smooth cumulative curve, cumulated from the bottom 
up, for each of the distributions in Exercise 1. 

6. Draw a smooth cumulative curve, cumulated from the top 
down, for each of the distributions in Exercise 1. 

6. Prepare a histogram to show individually each score in each 
distribution: 

(o) 78, 63, 85, 66, 91, 74, 80, 94, 86, 73, 79, 69, 84, 90, 75, 83, 88, 72, 
57, 85, 92, 65, 96, 84, 77, 76, 86, 81. 

(6) 38, 46, 27, 42, 44, 37, 47, 40, 32, 45, 36, 40, 41, 37, 46, 43, 35, 41, 
43, 29, 36, 43, 38, 45, 34, 39, 43, 46, 49, 37, 33, 41, 44, 32, 39. 

7. Smooth each of the distributions in Exercise 1 by threes. 

8. Draw the smoothed histogram for Exercise 1, Part (a). 

9. Draw the smoothed frequency polygons for Exercise 1, Parts 
(6) and (c). 

10. Draw the smoothed smooth frequency curve for Exercise 1, 
Part (d). 
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11. Draw an ogive for each of the distributions in Exercise 1. 

12. Insert the following percentile lines on the ogives drawn for 
Exercise 11: 

(а) Each tenth percentile. 

(б) The 5th, 10th, 25th, 50th, 75th, 90th, and 95th percen- 
tiles. 

(c) The 5th, 10th, 20th, 40th, 60th, 80th, 90th, and 95th 
percentiles. 

(d) The 10th, 30th, 50th, 70th, and 90th percentiles. 

13. Prepare an ogive representing the following two sets of scores 
and insert the 5th, 10th, 25th, 50th, 75th, 90th, and 95th percentile 
lines. 


Score f I fi 
140 - 1 

130 - 3 2 
120 - 6 5 
110 - 8 9 
100-12 13 
90-15 11 
80-10 8 
70-7 3 
60 - 4 0 
50-2 3 


14. Construct a normal curve with a maximum height of 4 inches 
and a base from +3 (t to — 3(r of 6 inches. 

16. Give two examples, not mentioned in the text, of educational 
data likely to form a curve of each of the following types: 


(а) Normal 

(б) Platykurtic 

(c) Leptokurtic 

(d) U-shaped 


(e) Rectangular 
(/) Positively skewed 
(gr) Negatively skewed 
(h) J-shaped 


16. State the type of curve which the graphic representation of 
each distribution in Exercises 1, 6, and 13 would most nearly ap- 
proach. 

17. Do the same as in Exercise 16 for the distributions in Exer- 
cises 2, 3, 4, and 5 on page 30. 
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CHAPTER IV 


Measures of Central Tendency 


Scope and function 

The one item of information that best describes or sum- 
marizes a frequency distribution is usually a statement of the 
scale value about which the cases composing it tend to cluster — 
in other words, a statement of its central point. Such a state- 
ment of a central point is called a measure of central tendency, or 
an average. Although some mathematicians and statisticians 
use the latter term as is common in elementary-school arith- 
metic and among laymen in daily life — that is, as the sum of a 
number of cases divided by their number — it is better usage to 
employ it in the more general sense stated in the previous 
sentence. Thus averages or measures of central tendency 
include means, medians, mid'-scores,lmodes, geometric means, 
harmonic means, and others. Of these, means and medians 
are most frequent in educational work. 

It will avoid possible confusion in some situations if readers 
will think of most averages as points on the scale, not as values 
of any of the actual cases. They may happen to coincide with 
them, but only in the case of the empirical or crude mode and 
of the mid-score of an odd number of cases are they truly scores. 

The Mean 

Mean, sometimes lengthened to arithmetic mean, is the statis- 
tical term for what is commonly called average. In other 
words, the mean, abbreviated M, is the sum of the items divided 
by their number. As such, it and the ordinary method of com- 
puting it are familiar to practically everyone who has completed 
elementary arithmetic. Its calculation, however, may fre- 
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quently be rendered much easier by the use of what is called 
the short method than it is by the procedure commonly 
employed. 

The short method of computing the mean requires that an 
assumed or estimated mean be taken, the sum of the differences 
of the data from it found, this sum divided by the number of 
cases, and the result added algebraically to the assumed mean. 
The result is the true mean. This method involves no loss of 
accuracy and has the advantage of markedly reducing the sizes 
of the numbers to be handled. To illustrate it, two examples 
are given below. The first compares the computation of the 
mean of a series of ungrouped data by the short method with 
its computation by the long or ordinary method; the second 
does the same for a grouped distribution. The advantage of 
the short method is greater in the second case than in the first. 

TABLE IV 


Computation of the Mean of an 
Unqrouped Series by Both Long and 
Short Methods 


By Long Method 

By Short Method 

Score 

Score d 

18 

18 

~22 

81 

81 

+41 

52 

52 

+12 

13 

13 

-27 

81 

81 

+41 

15 

15 

-25 

15 

15 

-25 

23 

23 

-17 

12 

12 

-28 

98 

98 

+58 

14 

14 

-26 

11 

11 

-29 

72 

72 

+32 

37 

37 

-3 

85 

W » 15 [6^ = 2X 

85 

+45 

+229 - 202 =» 

M - 41.8 

= 15 1 +27 = Xd 
c -|-1.8 

AbbM « 40. 

M =* 41.8 
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The first half of Table IV illiistrates the computation of the 
mean of 15 ungrouped scores by the long or typical elementary- 
school method. The formula is 


S, the capital Greek sigma, being the symbol for summation^ 
and X that for any score. The scores are added, and their sum, 
627, divided by 15, their number, gives 41.8, their mean. 

The second half of the table contains the same 15 cases.* By 
inspection a round number that appears to be close to the mean 
is chosen. It is called the assumed mean or guessed mean. No 
error is caused if it is not very close to the true mean, but the 
figures to be dealt with are smaller the closer it is. In this 
instance, 40 appears to be a good value to take as the assumed 
mean, AssM. Sometimes the smallest number in the series, or 
a “round” number slightly smaller than it, is taken as the 
assumed mean. So doing avoids negative differences. In this 
case, the value of the smallest case, 11, is so low in comparison 
with the largest ones that such an assumed mean would not be 
worth taking. In the column headed d, for difference or 
deviation, the remainders found by subtracting the assumed 
mean from each score are entered, with proper sign. Thus 
18 — 40 = —22, the first entry in that column; 81 — 40 = 
-(-41, so this is the second entry; the others are obtained 
similarly. It is not necessary, but likely to avoid confusion, 
to place the positive deviations in one column and the negative 
ones in another, as here. The d column is then added. In the 
example, the sum of the positive deviations is -(-229; that of the 
negative ones, —202; hence that of all is -(-27. This is 
labelled Sd, read “the sum of the deviations.” It is divided by 
N, here 15, and the result, -(-1.8, is the correction, abbreviated c, 
which is the amount that the assumed mean is in error. The 
correction is then added to the assumed mean to give the true 


* Sometimes S rather than S is used as the symbol of summation. 

* It is more convenient to have them in order of size, but the advantage gained 
from such arrangement is scarcely sufficient to justify the trouble of rearranging 
them if they are already in some other order. 
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mean, here 41.8, the same as found by the other method. The 
formula for this method may be written 

M = AssM + c or AssM + 

Table V presents the two methods for a grouped distribution. 


TABLE V 

Computation of the Mean of a Grouped Distribution 
BY Both Long and Short Methods 


By Long Method 


By Short Method 


Score f 

midp 

/ X midp 

380- 1 

390 

390 

360- 2 

370 

740 

340- 1 

350 

350 

320- 2 

330 

660 

300- 4 

310 

1240 

280- 0 

290 

0 

260- 4 

270 

1080 

240- 9 

250 

2250 

220- 9 

230 

2070 

200- 5 

210 

1050 

180- 7 

190 

1330 

== 44 


44 1 11160 


M = 253.64 


Score f 

d 

fd 

380- 1 

+7 

+ 7 

360- 2 

+6 

+12 

340- 1 

+5 

+ 5 

320- 2 

+4 

+ s 

300- 4 

+3 

+12 

280- 0 

+2 

0 

260- 4 

+ 1 

+ 4 

240- 9 

0 

+48 

220- 9 

-1 

- 9 

200- 5 

-2 

-10 

180- 7 

-3 

-21 

11 


-40 


44 I + 8 = S/d 


c = + .182 


i = 20. 


CT = + 3.64 
AssM = 250. 


M = 253.04 


In it the same data employed in previous chapters are utilized. 
In the long method, at the left, the mid-point of each class is 
found, multiplied by its frequency, the sum of these products 
found, and this divided by N to give the mean. For example, 
the mid-point of the top class is 390, its frequency 1, hence the 
product is 390; the mid-point of the next class is 370, its fre- 
quency 2, the product 740; and so on. The sum of the prod- 
ucts, 11,160, is divided by 44 to give 253.64, the mean. 

The second half of the table contains the computation by 
the short method, using the formula, 

M = AssM 4- ci or AssM -f ^ i. 
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Its first two columns are the same as those in the first half. 
As in the ungrouped series, an assumed mean is taken. It 
must be at the mid-point of some class. Here it is at 250, the 
mid-point of the 240-class. This class then has 0 entered after 
it in the d column. Above it are positive entries and below it 
negative ones. In each direction they begin with 1 for the 
class next to that containing the assumed mean and increase 1 
for each successive class. In other words, they show how many 
intervals above or below, that is, larger or smaller than, the 
assumed mean class each of the other classes is. In the example, 
the highest class is 7 intervals above, the lowest 3 below. The 
last or fd column contains the product of each d entry by the / 
for the same class. Since this product is always 0 for the class 
wherein the assumed mean lies, it is customary to enter the 
sum of the positive fd’s, here 4-48, in that space. The sum 
of the negative /d’s is —40; hence the sum of all is -1-8. Divid- 
ing this by N, 44, gives a correction of -|-•182, expressed in 
intervals. This is multiplied by20, the interval, and the result, 
+3.64, added to the assumed mean, 250., gives the mean, 
253.64. 

The median 

The median is the point on the scale on each side of which 
half the cases lie. It is not so common in ordinary life as the 
mean, but because of its usually greater ease of computation 
and sometimes better representation of the whole distribution 
it has come to be employed quite often in educational work. 
Probably its most frequent use is’in reporting test scores. 

The computation of the median, abbreviated Md, Med, or 
sometimes Mdn, is illustrated in Table VI for the same data 
just employed for the mean. The formula is 

Md = l+ —jr— i. 

Since I, N, /, and i have been used before, S is the only new 
symbol. It stands for 'partial sum, which will be explained 
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shortly. In using this formula, one should start with the first 

N 44 

term in the numerator, Here this is which equals 22. 

TABLE VI 

Computation op the Median 


Score f 


380- 1 


360- 2 


340- 1 


320- 2 

44 

300- 4 

-21 

280- 0 

Md = 240 + 20 

260- 4 

9 

240- 9 

= 242.22 

220- 9 


200- 5 


180- 7 


N = 44: 



Next the worker should obtain the partial sum, <S, by adding 
the frequencies from the bottom up until they give the largest 

N 

possible sum not greater than-^ • In this case 7 + 5 + 9 = 21, 

which is S. The denominator of the fraction, /, is the frequency 
of the next class above the highest one of which the frequency 
was used in getting S. In this instance, therefore, it is 9. The 
lower limit of the same class, here 240, is the I of the formula. 
Employing these values, also i as 20, the formula gives 

44 

— - 21 

Md = 240 + — 20 = 242.22. 


The median may also be found by working down from the 
top as well as up from the bottom, but the writer does not 
recommend this except as a check on the other method, for 
which purpose it is useful. The formula for it is 


Md = w 


N 

2 


/ 


S 


i. 


This gives, for the same example. 
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Md = 260 - 



9 


20 = 242 . 22 , 


as before. 

There are cases in which the computation of the median 
presents certain unusual features. Although the same formula 
fits such instances, it seems well to illustrate some of them. 
This has been done in Table VII. Part A presents the most 

TABLE VII 

Computation of the Median in Cases Presenting 
Certain Unusual Features 


Part A 

' Part B 

Part C ! 

Part D 

1 Part E 


N 

S = -and 

next / - 0 



/ of lowest 

II 

CO 

5 = — and 

next two or 
more/’s = 0 

i is not 
uniform 

class 

exceeds 

Score f 

Score f 

Salary f 

Pupils f 

Score f 

50- 2 

28- 1 

1600- 1 

600- 1 

180- 1 

45- 3 

26- 6 

1550- 3 

500- 4 

160- 1 

40- 6 

24-14 

1500- 5 

400- 3 

140- 2 

35- 7 

22- 8 

1450- 7 

300- 6 

120- 0 

30-12 

20- 0 

1400- 2 

200- 8 

100- 2 

25-15 

18- 4 

1350- 0 

100-11 

80- 1 

20-17 

16- 7 

1300- 0 

75- 9 

60- 3 

15-11 

14-10 

1250- 0 

50-13 

40- 5 

10- 6 

12- 5 

1200- 4 

25-18 

20-12 

5- 8 

10- 3 

1150- 6 

0-14 

0-38 

0- 3 

100 

II 

1100- 5 

N = S7 

N = 65 

II 

8 

^ on 

1050- 2 

1000- 1 

Md = 

Md = 


^=29 

CO 

CO 

II 

87 

65 ^ 

— = 45 

= 29 


32 

-:r~0 

2 


2 


S = 45 
Md = 25. 

Md = 20 


50+ -2o 

38 


^ 2 1 

S = IS 

= 72.12 

= 17.11 


= 20 -f- 

Si 


2 

Md = 1250 + 




= 21. 






— 1Z50 + 2 





- 1325. 




frequent of these cases, that in which S is just equal to and 
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there is no zero frequency in the next class. The formula can 
be applied in full; thus, 


Md = 25 + 



= 25, 


but it is unnecessary to do so. Instead, as soon as it is evident 
N 

that /S = , one may at once take the lower limit, here 25, of 

the next class above the highest one added in getting S, as the 
median. 

N 

Parts B and C present examples in which S = -^ and the next 

class or classes above the highest one used in getting S have 
zero frequencies. In such cases the median is equal to the 
lower limit of the next class above the highest one added plus 
one-half of the distance covered by the zero frequencies. Thus 
in Part B the next lower limit is 20 and one frequency of 0 

2 

occurs. Since the interval is 2, the median equals 20-1-2 =21. 

In Part C there are three frequencies of 0, with an interval of 50; 
3 X 50 

hence — 2 — ~ should be added to 1,250, the next lower 


limit, to give a median of 1,325. 

In Part D the interval is not uniform. The only special 
point to be watched here is that, for i in the formula, the value 
of the interval within which the median falls be substituted. 
Hence there, 

^-32 

Md = 50 + — 25 = 72.12. 


Part E differs from most cases in that the frequency of the 

N 

lowest class is greater than ■ In such cases / of the formula 

is the lower limit of the bottom class and S is 0. In the 
example, 

Md^0 + 20 = 17.11. 
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The mid'score or mid-measure 


Many workers in this field employ the term~median regard- 
less of whether the data in question compose a simple or a 
grouped distribution. Others, however, prefer to limit its use 
to the latter case. In other words, they employ the term only 
for a measure obtained from a grouped series by the use of the 
formula employed in the preceding section and use mid-score 
or mid-meamre, instead of median, for a simple or ungrouped 
series. This practice appears to be preferable. 

The mid-score or mid-measure may be simply defined as 
the middle one of a series of scores or other measures arranged 


in Older of size. 


If the total number is odd, it is the 


N -b 1 
2 


th 


case from either end; if the total number is even, it is the 

N 

mean of the two mid-most cases, that is, of the th and the 


/N \ 

( + 1 jth cases. For example, if 17 scores are arranged in 

17 + 1 

order of size, the mid-score is the — 2 — 9th; if 25 


cases, the 13th; if 28 cases, the mean of the 14th and 15th; 
and so on. In dealing with pupils’ papers it is frequently con- 
venient to find it by arranging the papers in order and counting 
in to the middle one without making a list of the scores. 

Since the mid-score is found for an ungrouped distribution 
and the median for a grouped one, the mid-score is usually 
found for a distribution containing few cases — perhaps not 
over 30 or 40 — and the median for one with more. 


The mode 

The mode, abbreviated Mo, is, as its name implies, that which 
is most popular or frequent. In other words, it is the point on 
the scale at which there are most cases. Therefore the mode 
of a graphed distribution is the point or scale value directly 
below the highest point of the curve. 

The mode obtained by mere inspection, as just suggested, is 
called the crude, empirical, apparent, or inspectional mode. It 
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is almost instantaneously obtainable and frequently serves as 
a good typical value of the thing being measured. 

In the case of tabulated distributions the crude mode cannot 
be exactly determined. It is customary either to state that it 
falls in a specified class or that it is the mid-point of 
that class. Thus the crude mode of the distribution 
given at the right would be said to be in the 50-class, 
or at 65. The former is preferable. 

The true mode is the point at which the greatest 
frequency would occur if there were no grouping of 
measures, if all were perfectly reliable, and if no 
sampling were involved. It cannot actually be deter- 
mined, but an approximation to it, known as the 
estimated, computed, or refined mode, is often found. 

Various formulas for this purpose have been suggested. Two 
of those cojnmonly used will be given here. More exact ones 
exist, but they are longer and more difficult. 

The most frequently used formula is Mo = 3Md — 2M. For 
distributions not very far from normal in their shape, this 
generally 5delds a close approximation to the true mode. For 
the distribution just given, 

Md = 51.54 and M = 52.33; 

therefore, 

Mo = SX 51.54 - 2 X 52.33 = 49.96. 

For the distribution of 44 cases so often used. 

Mo = 3 X 242.22 - 2 X 253.64 = 219.38. 

Since, however, this distribution is not nearly normal, this 
value is probably not a close approximation to the true mode. 
The reader will notice that this formula yields a value for the 
mode that is on the same side of the mean as the median, but 
three times as far away. Sometimes, indeed, the formula is 
written 

Mo = M - 3(M - Md), 

but it is simpler to multiply, remove the parentheses, and 
combine into the form given above. 


Score f 
90- 2 
80- 4 
70- 5 
60- 8 
50-13 
40-11 
30- 9 
20- 7 
10-_1 
AT = 60 
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For quite asymmetrical distributions, the second formula 
for the mode is generally preferable to that given above. It is 

Mo = Z + 

Ja Jb 

in which I is the lower limit of the modal class, the subscript a 
refers to the class just above the modal class and b to that just 
below it. For the distribution on the previous page this gives 

Mo = 60 + g = 54.21. 

In applying this formula to the distribution of 44 cases there 
is the difficulty that, since two classes each have the largest 
frequency, there is no one modal class. In such a case, prob- 
ably the best procedure is simply to take the point that divides 
the two classes as the mode. Here it is 240. If the two classes 
with the highest frequency are not adjacent, this cannot be 
followed, nor can it if there are more than two classes with 
the same largest frequency. In such cases a single mode for 
the distribution has so little meaning that it is scarcely worth 
determining. 

In cases with two or more tied frequencies as the largest, or 
even with two or more unequal frequencies greater than the 
adjacent ones, a distribution or curve may be referred to as 
bimodal if there are two, multimodal if there are more than two. 
Thus the much-used distribution of 44 cases is multimodal, 
but with two major modes and several minor modes. The one 
employed only in this section, with a single high point and no 
other frequency greater than both adjacent ones, is unimodal. 

The geometric mean 

The geometric mean is the iVth root of the product of N 
measures. In formula form, 

G, GM, or Ma = Vx^-Xi-X^- .... Xs. 

It is frequently employed in economics and elsewhere in con- 
nection with index numbers, but its only use in education that 
see^ worth mentioning here is to determine rates of increase 
of population, enrollment, and so forth. 
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Its use for the purpose just mentioned ordinarily occurs when 
predicted increase is desired and it is assumed that the rate of 
increase will remain the same as it has been for the period 
covered by the data at hand. Since this assumption is often 
not valid, the predictive value of the geometric mean is quite 
limited. All the data necessary to find it are the figures on 
whatever is involved at the beginning and end of a known 
period. In such situations a variation of the formula given 
above is most convenient. It is 

G = 

in which N is the number of time units, ft, is the frequency or 
number at the end of N time units, and/o that at the beginning. 

The application of the formula just given may be illustrated 
by assuming that the annual rate of increase in population is 
desired for a community which had 2,872 inhabitants in 1930 
and 3,628 in 1940. Since there are ten years involved, N = 10, 
and the formula gives 

G = = 1.0236.* 

This means that, on the average, each year’s population during 
1930-1940 was 1.0236 times that of the year before. In other 
words, the average annual increase was .0236 or 2.36 per cent. 

As a second example, we may suppose that a pupil was 45 
inches tall in September and 49 inches tall in June, 9 months 
later, and his average monthly rate of increase is desired. 

G = ^ ^1.0889 = 1.0086, 

* The value of 1.2632 was found by the use of logarithms. Without knowl- 
edge as to how to use them, readers will find it very difficult to secure geometric 
means except when N equals some power or product of 2 and 3. In such cases 
a succession of square and cube roots, as needed, can be extracted. For example, 
the fourth root is the square root of the square root; the sixth root is the square 
root of the cube root; and so on. Since this is the only procedure dealt with in 
this volume where the use of logarithms is likely to be necessary, and it does not 
actually occur very often in educational practice, their nature and use will not be 
explained here. 
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whence the average monthly rate of increase is .0086 or .86 per 
cent. 

The same procedure may be applied to rates of decrease as 
well as to those of increase. In such instances G comes out 
less than 1.00 and the difference is the rate of decrease. 


The harmonic mean 


It appears in place to mention this measure, but not to dis- 
cuss it at length. The harmonic mean is the reciprocal of the 
mean of the reciprocals of the measures. In formula form, 


II, IIM, or Mh 



It is rarely employed. For data involving both time and 
work units, both it and the arithmetic mean are appropriate 
but not directly comparable, since one is based on one unit, 
the other on the other unit. If one wishes an average time rate 
and is employing work units, he must use either the arithmetic 
mean of the absolute times or the harmonic mean of the time 
rates; if he is employing time units, he must use either the 
arithmetic mean of the time rates or the harmonic mean of the 
absolute times. 


The use of averages 

Mathematical statisticians enumerate six or more criteria of 
a good average and discuss the various measures of central 
tendency in terms thereof. For present purposes a less 
thorough treatment seems sufficient. 

The mean is, in most instances, the most reliable average. 
As distributions become more and more skew, however, it loses 
this advantage until for very skew ones it may be less reliable 
than the median, or even than the mode. It is likely to be 
more affected by one or a few large errors than is either the 
median or the mode. Moreover, the more heterogeneous the 
group of data, the less satisfactory is the mean as a representa- 
tive measure. Its computation is more difficult than that of 
the median, but its determination is a necessary step in com- 
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puting several other measures that may be wanted, whereas 
that of the median rarely is. 

The median has come to be almost universally employed as 
a measure of central tendency for distributions of test scores 
and many other kinds of data. It has the advantage of being 
consistent in type with the other point measures to be presented 
in the next chapter. If signs are neglected, the sum of devia- 
tions of scores from it is a minimum. When exact values of 
extreme cases are unknown it can be found, whereas the mean 
cannot. Also, when the units of measurement employed are 
not known to be equal, it is preferable. It does not assume 
such equality, whereas the mean does. 

The mode is rarely employed in educational work except as 
a quick, “rough-and-ready” measure. In some instances, 
especially where the greatest frequency constitutes a large 
fraction of the total number of cases, the mode is the most 
typical and representative measure of central tendency. For 
example, if a high-school principal receives $2,700 and each of 
eight teachers $1,620, the mode, $1,620, is for most purposes 
more representative of the typical salary in that high school 
than is the mean, $1,740. The mid-score, also $1,620, is of 
course equally typical. If, however, the figures were to be 
employed in such a connection that the total amount of salary 
would be wanted, the mean would be more useful, since the 
total could be determined from it but not from either of the 
other two. 
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EXERCISES AND PROBLEMS 


1. Find the mean, by the short method, of each of the following 

distributions: 






(a) 

(6) 

(c) 

(d) 

(e) 


Age f 

Score f 

Score f 

Score f 

Words f 

20-0- 2 

66- 4 

120- 2 

900- 6 

48- 

3 

19-6- 1 

63- 5 

110- 3 

850- 3 

46- 

4 

19-0- 3 

60- 3 

100- 6 

800- 7 

44- 

7 

18-6- 8 

57- 7 

90-11 

750- 8 

42- 

3 

18-0-11 

54-11 

80-18 

700- 4 

40- 

6 

17-6- 9 

51-19 

70-25 

650-10 

38- 

2 

17-0-12 

48-12 

60-17 

600- 8 

36- 

2 

16-6-13 

45-13 

50-10 

550- 1 

34- 

0 

16-0-15 

42- 6 

40- 5 

500- 3 

32- 

1 

15-6-14 

39- 2 

30- 2 

450- 2 

30- 

0 

15-0-17 

36- 4 

20- 2 

400- 1 

28- 

0 

14-6-15 

33- 1 

10- 0 


26- 

1 

14-0- 9 


0- 3 


24- 

0 

13-6- 4 




22- 

1 

13-0- 1 


♦ 


20- 

1 

2. Find the median of each of the distributions in 

Exercise 1 


3. Find the median of each of the following distributions: 


(o) 

(6) 

(c) 

W) 

(e) 


Score f 

Score f 

Score f 

Score f 

Pupils f 


260- 2 

52- 3 

15-3 

250- 1 

3500- 1 


240- 3 

50- 5 

14-4 

225- 3 

3000- 0 


220- 6 

48- 6 

13-2 

200- 1 

2500- 2 


200- 8 

46-11 

12-6 

175- 4 

2000- 3 


180-13 

44- 7 

11-1 

150- 2 

1500- 4 


160-16 

42 - 9 

10-3 

125- 6 

1000- 7 


140-10 

40- 0 

9-0 

100- 7 

500-22 


120- 9 

38- 2 

8-0 

75- 9 

400-13 


100- 9 

36- 8 

7-1 

50-16 

300-17 


80- 8 

34-18 

6-5 

25-25 

200-33 


60- 5 

32- 6 

5-4 

0-34 

100-54 


40- 4 

SO’ 7 

4-3 


0-68 



20- 2 3-4 

0 - 1 2-2 

1-3 


4. Find the mode, by the formula based on the mean and the 
median, for each of the distributions in Exercise 1. 

6 . Find the mode, by the formula based on the frequencies above 
and below the modal class, for each of the distributions in Exercise 1. 
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6 . Find the mid-score or mid-measure of each of the following sets 
of data: 

(а) 25, 3, 15, 1, 9, 18, 5, 16, 17, 11. 27, 16, 23, 19, 8, 9, 11, 23, 6, 9, 28, 19, 
18, 22. 

(б) 74, 67, 89, 75, 81, 90, 78, 85, 72, 95, 88, 61, 80, 76, 72, 93, 87, 85, 74, 
77, 82. 

(c) 104, 115, 94, 97, 109, 91, 122, 86, 103, 98, 93, 111, 101, 82, 99, 105, 97, 
106, 89, 114, 108, 113, 92, 96, 88, 95, 100, 106, 99, 92. 

7. If a school enrolled 225 pupils in 1935 and 440 in 1945, what 
was the average annual rate of increase? 

8. If a pupil weighs 92 pounds on September 1 and 106 pounds 
on May 1, what is his average monthly rate of increase? 

9. If the per cent of failure is reduced from 12. to 7.4 in four years, 
what is the average annual decrease? 
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CHAPTER V 


Other Point Measures 


General definition and function 

As was suggested in the discussion of ogives in Chapter III, 
it is frequently useful to indicate other points in a distribution 
in addition to averages. Such points are based upon the divi- 
sion of the total number of cases in a distribution into an equal 
number of parts. The points which so divide it are denoted 
by terms formed by adding the suffix -iZe to a stem which de- 
notes into how many parts the* distribution is divided. The 
single exception to this is that, as already stated in Chapter IV, 
the measure that divides it into two equal parts is called the 
median. Since to divide a distribution into any number of parts 
lequires one less division point than there are parts^ one 
inedian divides it into two parts, two tertiles into three parts, 
three quartiles into four parts, four quintiles into five parts, and 
so on. In other words, there is always one less point of any 
kind than its name suggests, that is, nine deciles, ninety-nine 
percentiles, and so on. 

There is no limit to the number of such points possible, but 
in practice only a few are employed. Most common are the 
quartiles; next are the percentiles; others are relatively rare 
but occasionally useful. 

Each set of point measures is numbered from first up to the 
highest. Each measure so numbered indicates the fraction of 
cases below the point indicated. Thus one-fourth of the cases 
in a distribution are below its first quartile; two-fourths are 
below its second quartile, which is the same as its median; 
three-fourths are below its third or upper quartile. Similarly, 
to ^ve several examples, at random, one-tenth of the cases 
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lie below the first decile; seven per cent lie below the seventh 
percentile; two-thirds lie below the second tertile; ninety- 
eight per cent lie below the ninety-eighth percentile; and so 
on. By subtracting the fraction below any point from unity, 
the percent above it is easily found. Thus three-fourths are 
above the first quartile; one-tenth are above the ninth decile; 
two per cent are above the ninety-eighth percentile; and so 
forth. 

It is frequently possible to call a point measure by any one 
of several names. Thus the median is the same as the second 
quartile, the third sextile, the fifth decile, the fiftieth percentile, 
and all other halfway measures. Likewise the first quartile 
and the twenty-fifth percentile are the same; the first decile 
and the tenth percentile; the third quintile, the sixth decile, 
and the sixtieth percentile; and so on. 

These measures are regularly abbreviated by their initial 
letters or syllables with appropriate numerical subscripts. Thus 
Qi is the symbol for the first quartile, Qs for the third quartile, 
Quinti for the first quintile. Pi or Pen for the fifth percentile, 
and similarly for others. Readers should note that, although 
the usage is sometimes employed by those unfamiliar with 
correct terminology in this connection, the terms ending in -ile 
are not correctly applied to the parts into which a distribution 
is divided. For example, the part below the first quartile 
should be called the first quarter or fourth; the part between 
it and the second quartile is the second quarter or fourth; the 
part below the first percentile is the first per cent or hundredth; 
the part between the thirty-third and the thirty-fourth per- 
centiles is the thirty-fourth per cent or hundredth; and 
likewise for others. 

Computation of quartiles, percentiles, and similar measures 

The method of computation of quartiles, percentiles, and 
similar measures is similar to that for finding the median. The 
only difference is that the first term in the numerator, instead of 
N 

being is whatever fraction of N lies below the measure to be 
computed. Therefore the formula for the first quartile 
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Qi — I A j — i’, 


that for the third quartile is 


SN 


S 


Qs = i + 


that for the fifth percentile is 




- S 




and similarly for others. 

Table VIII presents the computation of the first and third 

TABLE VIII 

Computation of Quartilks and Pkrckntiles 


Score f 
380- 1 
360- 2 
340- 1 
320- 2 
300- 4 
280- 0 
260- 4 
240- 9 
220- 9 
200 - 5 
180-_7 
V = 44 


7 -' 

= 200 4- -^-7 — 20 = 216.00 
5 


3 X 44 


- 30 


Qz = 260 + 


Ps = 180 + 


Pio = 180 -f 


Pto = 260 -h 


4 

5 X 44 
100 


20 = 275.00 


- 0 


7 

10 X 44 


- 20 = 186.29 


100 


- 0 


7 

70 X 44 


20 = 192.57 


100 


30 


P 90 = 320 4“ 


4 

90_X_44 

100 


20 = 264.00 


- 38 


20 = 336.00 


quartiles and four percentiles for the series of data previously 
employed. The first step in determining the first quartile is to 
take one-fourth of N, here 44, which in this case is 11. S is 7, 
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since adding the next frequency, 5, would give a sum larger 
than 11. The frequency of the next class is, as just stated, 5; 
its lower limi t is 200; and i is 20. Therefore, 

Qi = 200 + — 20 = 216.00 

The others are found in like fashion, the first term in the 
numerator of each being the fraction of N indicated by the 
name of the measure being found. 

Overlapping 

Sometimes we wish to compare distributions by determining 
how much they overlap. To do so, we find what fraction of 
the cases in one overlap or extend beyond some point in the 
other. Overlapping may be computed in either direction, 
above or below, and may be past any point. The point most 
commonly employed for this purpose is the median; next to it 
are the quartiles. The fraction is usually expressed in per cent. 

The data at the side may be employed to illustrate the 
determination of overlapping. They represent the mental 
ages of pupils in a fifth and a sixth grade in the 
same school. If we desire to find out what 
periCent of those in the fifth grade exceed the 
median of the sixth, the first step is to find this 
median. It is 

^-3 

11 + 1 = llVe. 

Those of the fifth-grade pupils with mental ages 
above this are the 2 in the 13-class, the 3 in the 12-class, and 
of the 11 in the 11-class. This fraction is taken because 
of the interval is above ll®/®, the point past which overlapping 
is to be measured. 

2 -f 3 4- y = eVe, 

the number of fifth graders above the sixth-grade median. 
This divided by 40 gives about 17 per cent. 


MA V VI 

14- 1 

13- 2 4 

12- 3 10 

11- 11 18 

10- 16 2 

9- 6 1 

8 - _2 _ 

iV = 40 36 
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For a second example, we may assume that the per cent of 
sixth graders below the first quartile of the fifth is wanted. 


0i = 10 + 



16 


1 = 10.125. 


The number of sixth graders below 10.125 is 1 + .125 X 2 = 
1.25, which is approximately per cent of 36. 

Another, and rarer, method of measuring overlapping is to 
take the difference between similar measures of central tendency 
of the two distributions. For the data above, 

M(h = 10.75 and M(k = 11.83, 

hence the difference is 1.08. To render this relative rather than 
absolute, it is divided by an appropriate measure of variability 
of the distribution the overlapping of which is being found. 
Thus, if that of the sixth on the fifth is desired, 1.08 may be 
divided by the quartile deviation of the sixth-grade scores, the 
computation of which is presented in the next chapter, .83, the 
result being 1.30. 

Bi-serial r, the meaning and calculation of which will be given 
in Chapter XII, may also be employed to measure overlapping. 
It appears to merit more use for this purpose than it has 
received. The greater its value, the less the overlapping. 


Finding percentile ranks of scores in grouped series 

There are two somewhat different situations in which per- 
centile ranks of scores in grouped distributions may be desired. 
Both will be explained. One is that in which a percentile rank 
is to be assigned to all the cases in a class, considering them as 
being located at its mid-point. This is very easily done. All 
that is necessary is to find the siun of the frequencies in all 
classes below the one in question plus half of its frequency, 
divide this by the total number of cases, and multiply by 100. 
In formula form, this is 


100 
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For example, the percentile rank of the mid-point of the 220- 
class in Table VIII is 


100 


7 + 5 + 1 
44 


1650 

44 


37.5: 


that of the cases in the 300-class is 


100 


44 


3600 

44 


81.82; 


and similarly for others. 

The other situation is that in which the percentile rank of a 
given score is wanted. The general procedure is the same as 
in the other case, the only difference being in the last term of 
the nmnerator. In this instance this term is found by multi- 
plying the frequency of the class wherein the score lies by a 
fraction the numerator of which is the difference between the 
score and the lower limit and the denominator of which is the 
class interval. Thus the fraction represents the proportion 
of the cases in the class that lie below the score in question. 
To illustrate the procedure, a score of 217 may be used. Its 
percentile rank is 


100 


7 + 


217 - 200 , 
^ ^ 
44 


1125 

44 


25.57. 


As another example, the percentile rank or PR of 365 is 


7 + 5 + 9 + 94.4 + 0 + 4 + 2+ 1 I 365-360 g 


100 


20 


44 


The formula for this may be written 


4150 

44 


= 94.32. 


PR = 100 




N 


in which X represents the score to be expressed as a percentile. 
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Changing ranks into percentiles 

It is often convenient to compare or combine the ranks of 
an individual in several groups, or perhaps those of different 
individuals in different groups. If the numbers of cases in 
the groups are the same, such comparisons or combinations can 
be made directly and simply by the use of ordinary ranks. 
For example, if a pupil is third in a group of 35 and seventh in 
another group of 35, it is at once evident that he is 4 ranks 
better in one than in the other and that his average rank in the 
two is fifth. If, however, the numbers of cases in the groups 
concerned differ, it is not readily apparent how much his rank 
in one differs from his rank in another, nor what his average 
rank is. 

To make comparisons and combinations in such cases all 
ranks must be changed to the same basis. This may be any 
system of dividing a distribution into parts with equal numbers 
of cases, such as by quartiles, deciles, and so on. In practice, 
percentile ranks or scores are almost always used. The general 
formula for changing ordinary rank* into any other rank is 

«-2 

XR ~X, 

which becomes 

2R - 1 y 
2N ^ 
or 

XR-§ 

N 


R is the symbol for rank, when used with X referring to the 
new rank and when alone to the ordinary rank with 1 as lowest 
and N as highest, and X is the order of the new rank. For 
percentile rank the formula becomes 


" 2 2fl - 1 lOOR - 50 

PR = — 100 or 100 or ^ 


‘ leaders who have diflSculty in assigning ordinary ranks should consult the 
section thereon near the first part of Chapter X. 
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The use of this formula may be illustrated by the case of a 
pupil who stands twelfth from the bottom of an algebra class 
of 28 members, tenth from the bottom of a Latin class of 27, 
sixth from the bottom of an English class of 30, and eighteenth 
from the bottom of a history class of 24. His best rank is 
evidently in history and his worst in English, but neither the 
differences among his ranks nor his average rank can be deter- 
mined directly from the facts given. His percentile ranks may 
be found as follows; 


PRam — 

PRsog = 


100 X 12 - 50 
28 

100 X 6 - 50 
30 


= 41.07 


= 18.33 


PRim — 
Pfintot = 


100 X 10 - 50 
27 

100 X 18 - 50 
24 


35.19 
= 72.92. 


With these percentile ranks available, the average rank of this 
pupil can readily be found by averaging them, as follows: 

41.07 + 35.19 + 18.33 + 72.92 __ 


It is sometimes desirable to be able to change percentile or 
other similar ranks into ordinary ranks. The general formula is 


N-XR 

X 


N-XR + 


+ 2or = 


For changing percentile ranks into ordinary ranks it becomes 


^ N -PR 
100 


, 1 

+ 2 or 


N-PR + 50 
100 


For example, if a pupil has a percentile rank of 15 in a group 
of 30, his ordinary rank is 

30 X 15 + 50 . 

100 ® 

from the bottom. 

Since a percentile or any other rank of the -die type indicates 
the fraction of cases below it, there can be no actual case with 
a rank of either 0 or the whole number of divisions formed by 
the -die points. A 0 rank would indicate a rank above no cases 
and below all; the latter one, above all and below none. A 
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score cannot, however, be either below or above itself; hence 
the lowest rank is that which indicates a position below all the 
other, or N-1, scores and the highest rank that which indicates 
a position above all the other, or N-1 , scores. To dispose of the 
score for which the rank is being sought, it is considered half 
above and half below itself. In other words, the assumed 
distribution of cases is the same as that of cases in a class 
of a grouped distribution, as explained on p. 25. Therefore 

the lowest possible -ile rank of a case is ^ and the highest 
X — For percentiles these become 

50 j iaa 50 
and 100 — -j^- 
N N 


Therefore the percentile rank of the lowest of 10 cases is 

100 50 ^ 

2N 10 “ 

and that of the highest is 

100 - ^ = 95 ; 

that of the lowest of 29 cases is 

and that of the highest 

50 

100 - g = 98.28; 
and likewise with others. 


EXERCISES AND PROBLEMS 

1. Compute the first and third quartiles, the seventh, fifteenth, 
fortieth, and ninetieth percentiles, of the data in each part of Exer- 
cise 1 on page 69. 

2. Compute the first and third quartiles, the fifth, tenth, and 
ninety-eighth percentiles, of the data in each part of Exercise 3 on 
page 69. 

8. Determine the per cent of overlapping of each distribution 
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above the median and also below the first quartile of the distribution 
paired with it: 


(a) (b) 


IQ Fast 

Aver. 

Score XI 

XII 

140- 2 


115- 

2 

130- 4 


110- 1 

4 

120- 18 

1 

105- 3 

6 

no- 14 

5 

100- 7 

12 

100- 

28 

95-19 

11 

90- 

26 

90-24 

9 

80- 

4 

85-16 

3 



80- 8 

1 



75- 3 

0 



70- 2 

1 


4. Find the percentile rank of the mid-point of each class in each 
part of Exercises 1 and 3 on page 69. 

6. Find the percentile rank of the following scores in the indicated 
parts of Exercise 1 on page 69: 

(а) 13-10, 14-9, 16-2, 18-0, 19-1. 

(б) 37, 42, 50, 58, 68. 

(c) 47, 69, 76, 95, 114. 

6. Find the average percentile rank of a pupil whose ordinary 
ranks from the bottom up are as follows: seventh among 25, third 
among 22, eleventh among 26, eighth among 19, fourteenth among 32. 

7. How many percentile points is each of the following pupils 
above or below each of the other two? 

(а) John, sixteenth in a group of 42. 

(б) James, ninth in a group of 27. 

(c) Joseph, tenth in a group of 30. 

8. If a pupil has a percentile rank of 61.36 in a group of 22 and 
one of 30 in a group of 35, what is his ordinary rank in each instance? 

9. What are the percentile ranks of the highest and lowest of each 
of the following numbers of cases? (a) 32; (6) 6; (c) 250. 
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CHAPTER VI 


Measures of Variability 


Function and scope 

Although the one most significant and descriptive item of 
information about a distribution is generally a measure of its 
central tendency, this alone is quite inadequate to portray it. 
Two sets of data may have the same average but differ greatly 
in how widely they spread about that point. This is illustrated 
by Figure 16 and the accompanying distributions, which show 



the distributions of intelligence quotients of two groups of 
pupils. Both average 100, but group A is a compact, homo- 
geneous group of normal and near-normal pupils, whereas 
group B includes a range from high-class morons up to quite 
superior children. This difference would not be revealed by 
comparison of their averages. 
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There are other descriptive facts that might be given for any 
distribution, but usually the next most significant to a measure 
of central tendency is one of variability, also called variation, 
dispersion, deviation, departure, discrepancy, spread, scatter, 
and fluctuation. There are a number of measures of this char- 
acteristic. Of these the range, the quartile deviation, the 
tO-90 percentile range, the mean deviation, the standard deviation, 
and the median deviation will be presented in this chapter. 

The measures of variability just named are absolute numer- 
ical expressions, found in terms of the scoring system used for 
whatever is being measured, that express the spread or scatter 
of the data in terms of base-line units. As units of distance 
they may be employed to measure and state distance from one 
case to another, from a case to an average, from a case to a 
quartile, percentile, or other similar point, indeed from any 
point or value in the distribution to any other. They are 
frequently used as measures of distances from averages, or 
sometimes from other points, that include specified per cents 
or other fractions of the whole number of cases. Their inter- 
pretation in this connection is usually based on the assumption 
of normal distribution and may be in error to the extent that 
this assumption is not fulfilled. 

In addition to absolute measures of variability there are rela- 
tive measures. The coefficient of variability is the only one that 
will be given. 


The range 

The range, or absolute range, is defined as the distance from 
the lowest case to the highest. It may, therefore, be obtained 
by simple subtraction. For example, the range of the eighteen 
salaries given on p. 19 is $1,900 — $1,250 = $650. 

In the case of a grouped distribution, the exact values of the 
two extreme cases are not known; hence the range cannot be 
determined so easily as for a simple or ungrouped distribution. 
Usually it is sufficiently close to state the range of such a 
distribution approximately by inspection and simple subtrac- 
tion. Thus for the distribution of 44 cases used in previous 
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chapters, which begins with the 180-class and ends with the 
380-class, the range would be stated as approximately 200. 

If a more accurate range than that just suggested is desired, 
it should be found by determining the values of the two 
extreme cases according to the assumption of distribution 
within the classes, as explained in Chapter II. In the dis- 
tribution just referred to, the lowest of the 7 cases in the 
180-class is assuiped to have a value of 

180 + 2 ^ = 181.43 

and the 1 case in the 380-class to have a value of 390; whence 
the range is 

390 - 181.43 = 208 . 57 . 

Since a change in the value of either of the extreme cases 
affects the range, and may affect it largely, the range does not 
possess high stability. In fact, it is the most unreliable of the 
measures of variability mentioned in this chapter. Largely 
for this reason it is not very frequently employed. 

The quartile deviation 

The quartile deviation, or semi-interquartih range, is one-half 
of the distance between the first and third quartiles. Its for- 
mula, therefore, is 

Q, or rarely QD, = 

Since the middle 50 per cent of the cases in a distribution lie 
between the first and the third quartiles, the quartile deviation 
is half the distance that includes the middle half of the cases. 
If the distribution is sjunmetrical, one-fourth of the cases are 
within IQ of the median in each direction, or one-half in both. 
The more asynametrical a distribution is, the more is the fraction 
of cases within IQ of the median liable to vary from one-half. 

The application of the formula may be illustrated by em- 
ploying the data used on p. 73 and elsewhere. There Qi was 
found to be 216 and Qz to be 275; whence 

275 - 216 
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Since this distribution is quite asymmetrical, it k not probable 
that just half the cases lie within 29.5 points of the median, 
242.22. Actually, despite the lack of symmetry, if the position 
of each case is determined according to the assumption regu- 
larly employed for grouped tabulations, it is found that almost 
exactly one-half the cases are between 


and 


242.22 - 29.5 = 212.72 
242.22 + 29.5 = 271.72. 


As implied by portions of the preceding paragraphs, the 
quartile deviation is associated with the median when with any 
measure of central tendency. This is because it is derived 
from measures, the quartile points, found by the same general 
method as the median. 


The 10-90 percentile range 

As its name implies, the 10,-90 percentile range, symbolized 
by Dio_ 9 o, is the distance from the tenth to the ninetieth per- 
centile. Therefore it includes the middle 80 per cent of the 
cases. Because it is based upon a larger fraction of the cases 
than is the quartile deviation, it is more reliable than that 
measure, and because it does not depend upon either of the 
single extreme measures, it is more reliable than the range. 
Despite this relatively high reliability, however, it has not 
come into common use. Also, it is rarely employed as a dis- 
tance from an average. 

The formula, 

Dio-90 = Pw — PlO, 

may be illustrated for the same data as were used for the 
quartile deviatio|^ For them, Pw has been found to be 336 
and Pio to be 192.57, hence 

Ao-90 = 336 - 192.57 = 143.43. 

The mean deviation 

The mean deviation, MD — also called the average deviation, 
AD, and the mean absolute deviation — ^is the mean of the devia- 
tions, with their signs neglected, from the mean, median, oi 
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mid-score. It is more often found about the mean than about 
the median or mid-score, but it is a minimum about either of 
the latter. In educational work the mean deviation is not 


used so often as the quartile, standard, and median deviations. 
In a normal distribution approximately 57.5 per cent of the 
cases are within \MD of the average. 

To compute the mean deviation about the mid-score of a 


simple series, the formula 


may be employed. In it ^2 


is the sum of all the cases larger than the mid-score and Si the 
sum of all smaller. For the ungrouped series in Table IV, on 
page 56, the mid-score is 23. Hence 


MDjftds — 

( 81 + 52 + 81 + 98 + 72 + 37 + 85 ) -( 18 + 13 + 15 + 15 + 12 + 14 + 11 ) ^ „ 


It may, of course, also be obtained by finding the deviation of 
each case from the mid-score, 23, summing these deviations, 
and dividing by N, here 15. In most instances, however, the 
former method is easier. 

The mean deviation about the mean of a simple series is 
usually best found by a formula easily applied in connection 
with the short method of finding the mean. It is 


The symbol | | indicates that the summation is absolute, 
that is, with signs neglected; Nt is the number of cases below 
the mean and Na the number above it. The correction, c, 
keeps its sign. Also, it must be small enough that no case 
falls between the assumed mean and the true mean. If it is 
not, another assumed mean must be taken. Jn this case, that 
is for the data on p. 56, c is satisfactory. S|d| is the sum of 
the positive deviations, 229, and the negative ones, 202. The 
Sfiumber of cases below the mean, 41.8, is 9; that above it, 6. 
Therefore, 


MDm 


229 + 202 + 1 . 8(9 - 6 ) 
15 


29 . 09 . 


The method of computing the mean deviation of the measures 
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in a grouped distribution differs from that just given only in 
detail. The formula includes the terms in that just used with 
additions. It is 

MD = S|/d| + c(N, - + (-^5 + c^)N„ . 

The new symbol, Nm, refers to the number of cases or frequency 
in the class containing the mean or median. + iVo + 
must equal N. The assumed mean or median must be so taken 
that c is less than +.5, but not less than —.5. Applying this 
formula to the data in Table V, on page 58, we have 

MD. - 48 + 40 + .182(21 -^4) + (.26 +.182-)9 


The method of finding the mean deviation about the median 
is the same. Since, however, the computation of the median 
does not involve the use of a correction, this must be found. 
It is 

"Md-AssMd 

Cutd — • 


The assumed median must be taken at the mid-point of the 
same class in which the median falls, in order that the value 
of c fall within the limits stated above. If, as is usually true, 
the mean and median lie in the same class, the same assumed 
mean and 0-deviation class is used for both, with the result 
that c is the only term in the formula that differs. Since this 
is true for the data in Table V, and since for them 


Cud 


242.22 - 250. 
20 


.389, 


the formula gives for them 


MDua 


48 + 40 - .389(21 - 14) + (-25 + -389^)9 ^ 

44 


This, it will be noted, is slightly less than MD„, in accord with 
the fact that the mean deviation about the median is a minimxun. 

Very careful workers sometimes apply a correction to the 
mean deviation of a grouped distribution as obtained by the 
formula above in order to make it more nearly correct for the 
actual or ungrouped cases. This correction consists in the sub- 
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traction of approximately the interval divided by ten times 
the number of classes. If n is used for the number of classes, 
the formula is 

MDctrr = MDm — 

For the example above, 


= 41.74- 

20 

= 41.56 

10 X 11 

= 40.40 - 

20 

= 40.22. 

10 X 11 


The standard deviation 

The standard deviation, also called the root-mean-square devia- 
tion, is rated by most statisticians as the most important 
measure of variability. It is the most reliable measure of 
deviation and is a quantity often needed for other statistical 
computations and procedures. It is associated with the mean 
if with any measure of central tendency. 

The standard deviation is the square root of the mean of the 
squares of the deviations from the mean of the distribution. 
In a normal distribution, about 68.27 per cent of the cases fall 
within one standard deviation of the mean, half or 34.13 per 
cent on each side. 

The formula for the standard deviation of a simple series is 

In this form it requires that deviations from the true mean be 
found. Unless the true mean is a “round” number, it is easier 
to take an assumed mean, as in the second part of Table IV. 
The formula in this case becomes 



Sometimes the actual scores rather than deviations are used, in 
which case it is 

<r = -y/ - M*. 

Table IX illustrates the computation of the standard devia- 



Computation of the Standard Deviation of an Ungrouped 
Series by Use of Scores and of Deviations 
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tion of the same simple series used in Table IV and elsewhere 
by both the last two formulas. In each case there are the same 
columns as in the table just referred to, with an additional one 
of squares added. The additional colimm in the part at the 
left contains the squares of the scores. Their sum, 41,141, is 
divided by N, 15, to give 2,742.73. From this 1,747.24, the 
square of 41.8, the mean, is subtracted, leaving 995.49. This 
is the square of the standard deviation; hence its square root, 
31.55, is the standard deviation. 

The additional column in the part at the right contains the 
squares of the deviations. Its sum is 14,981. Dividing this by 
N, 15, gives 998.73. The square of the correction, 1.8, is 3.24, 
and subtracting this from 998.73 leaves 995.49, the same value 
for <r* as was obtained previously. Therefore <r, as before, 
=31.55. 

The computation of the standard deviation of a grouped dis- 
tribution is shown in Table X, which repeats the work for the 
computation of the mean shown in the second half of Table V 
on page 58 with the omission of the last steps, which are 


TABLE X 

Computation of Standard Deviation 
OF A Grouped Distribution 


/ 

d 

fd 

fd^ 

380- 1 

+7 

+7 

49 

360- 2 

+6 

+ 12 

72 

340- 1 

+5 

+5 

25 

320- 2 

+4 

+8 

32 

300- 4 

+3 

+12 

36 

280- 0 

+2 

0 

0 

260- 4 

+1 

+4 

4 

240- 9 

0 

+48 

0 

220- 9 

-1 

-9 

9 

200- 5 

-2 

-10 

20 

180- 7 

-3 

-21 

63 

II 


-40 

44 [3W = 



II 

loo 

S* = 7.0455 



c = +.182 

c* = .0331 




«r» = 7.0124 




<r = 2.648 intervals 


i = 20. 

<r = 62.96 units 
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unnecessary here, and with certain additions. The first of these 
is the colximn headed /tP. Each entry therein is the product of 
the d and fd entries in the same row. Thus 7 X 7 = 49, the 
first entiy; 6 X 12 = 72, the second entry; and so on. Since 
each entry is the product of two like-signed numbers, all are 
positive. This column is totalled, giving 310. This sum is 
divided by N, 44, and the result, 7.0455, is labelled From 
this .0331,. which is the square of the correction, is subtracted, 
leaving 7.0124. The quantity thus obtained is the square of 
the standard deviation, so its square root is extracted. This, 
2.648, is the standard deviation in terms of intervals. Since 
it is regularly desirable to express it in the units employed, 
2.648 is multiplied by 20, the interval, giving a value of 52.96 
units. 

Although in the large majority of cases values of the standard 
deviation found as described above are employed, there are 
several corrections that may be applied to produce more nearly 
correct values. Generally their effect is so slight in comparison 
with other possible sources of error that they are neglected, but 
it seems well to mention some of them. The most important, 
often called Sheppard’s correction, approximately corrects the 
error introduced by assuming all cases to be at class mid-points. 
The formula is, in terms of intervals, 



and, in terms of units, 



For the example in Table X this gives 

= 2.648* — ^ - 2-632 intervals. 


<^corr 


or 


.^52 .96* - = 62.64 units. 

As here, corrected <t is always less than obtained <r. 
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If two series of measures of the same trait, such as two 
duplicate forms of a test, are available, the effect of variable 
errors upon the standard deviation is eliminated by using 
di X di, instead of cP, di being the deviation of the case in one 
series and di that in the other. This can easily be applied to 
ungrouped data, but with some difficulty to grouped ones. Its 
effect is to reduce the obtained value slightly if the two series 
correlate closely, more if they do not. 

If the coefficient of reliability, to be explained in Chapter IX, 
of a series of measures is known, the value of the standard 
deviation of true or perfectly reliable measures^ can be esti- 
mated by the formula, 

<^0) ~ 


in which <r„ is the true standard deviation and the co- 
efficient of reliability. 

If the standard deviation of a sample is taken as an estimate 
of that of the whole population from which the sample was 
chosen — a common practice — a more accurate value is obtained 
if the sum of the squares is divided by iV — 1 instead of by N. 
So doing naturally serves to increase the value of <r. Since the 
effect of using N — 1 rather than N becomes less and less as 
the value of N increases, many workers do not trouble to sub- 
tract the 1 if the number of cases is fairly large. 

It is sometimes convenient to be able to find the standard 


deviation of the combined score on two or more duplicate forms 
of a test when it is known for one. To do so the coefficient of 
reliability must also be available. For n forms of a test the 
formula is 



For only two forms this becomes 

ffj -b 


<ri + i = 


* Trae or perfectly reliable measures cannot actually be found, but certain facte 
about them may be determined. They will receive more attention later. 
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If the standard deviations of the forms are equal, these formulas 
reduce to 

<r« = ffiV n + n(w — l)rii 

and 

+ I = <riV2(l + rii), 

respectively. The average standard deviation of n forms is 

and that of only two forms is 




(1 + rii). 


The variance 

The variance, or mean square deviation, is the square of the 
standard deviation. It has no regularly employed symbol 
other than <r*. It may be employed as a measure of variability 
and is receiving increasing use, especially in connection with the 
analysis of variance and of covariance, topics too broad in scope 
to receive adequate treatment in this volume. Also, it is 
utilized in the interpretation of coefficients of correlation. For 
the data in Table X, it is 52.96*, or 2,804.76. 

The median deviation 

The median deviation (MdD), rarely called the probable 
deviation, is just what its name implies, the median of the 
deviations about the mean. Hence half the deviations from 
the mean exceed it and half are less; in other words, half the 
cases in a distribution are within 1 MdD of the mean and half 
are farther from the mean. 

The approved method of finding the median deviation is not 
the apparently direct one of tabulating the deviations and 
finding their median, but is rather based upon the fact that for 
a normal distribution it is a fixed fraction, .6745, of the standard 
deviation. Applying this gives AfdZ) = .6745 X 31.55 = 21.28 
for the data in Table IX and .6745 X 52.96 = 35.72 for those in 
Tkble X. The formula MdD = .6745<r is used regularly. 
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despite the fact that the more a distribution departs from 
normality, the greater the probable inaccuracy resulting from it. 

For a normal distribution the quartile and median deviations 
are the same and for any other shape not extremely asymmetri- 
cal or irregular their values are likely to differ only slightly. 

Unfortunately the term probable error (PE) has come to be 
very commonly applied to the median deviation. If used at 
all, it should be employed only when the cases dealt with are 
actually errors. In this sense it is explained and used later 
in the book. 

Comparison of measures of variability 

Figure 17 presents a normal curve with the distances shown 
that are covered by the measures of variability so far discussed. 
The range and 10-90 percentile range extend the whole distance 



Fig. 1 7. Normal Curve with Measures of Variability 


between their respective limits; whereas the others extend their 
respective distances in both directions from the mean or 
median, which, of course, coincide in a normal curve. The 
largest is the range, which includes all the cases; next in size 
is the 10-90 percentile range, which takes in the middle 80 per 
cent; then comes the standard deviation, which when laid off 
in both directions from the mean embraces 68.27 per cent; next 
is the mean deviation, which similarly includes 57.6 per cent 
about either the mean or the median; finally the median 
deviation includes the middle 50 per cent about the mean and 
the quartile deviation the same about the median. 

Table XI shows the computation and relative sizes of these 




TABLE XI 

Computation of Measures of Variability 
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measures for a set of grouped data. Their calculation follows 
the formulas and procedures already explained. Since the dis- 
tribution is skew, but not greatly so, the relationships existing 
among the measures of variability and their interpretation are 
not just those assumed on the supposition of normality, but 
do not differ greatly from them. For example, the quartile and 
median deviations are not exactly equal, as they would be for 
a normal distribution, but differ slightly. 

Figure 18 graphically represents the data in Table XI, in 
histogram form, and their measures of variability. Since the 



Fig. 1 8. Graphic Representation o^ Data in Table XI, Showing 
Measures of Variability 


distribution is not far from normal, the areas included by the 
latter are approximately the same as those shown in Figure 18. 

In the discussions of the various measures of variability, the 
per cents of the cases in a normal distribution included by 
distances from an average equal to them have been stated. 
Not only is this known for a distance equal to each such 
measure, but also for any distance expressed in terms of any 
of them except the range. The tables in the Appendix give 
this, among other facts, for distances expressed in terms of 
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standard and median or quartile deviations; but it seems in 
place to present here a brief table showing the per cents of 
cases included within distances from the average equal to even 
half and integral values of each of the five measures. Table XII 

TABLE XII 

Per Cents of Cases in a Normal Distribution Included 
Within the Given Distances from an Average 


Number of 
Deviations 

MdD or Q 

MD 

<r 

Dio-90 

.5 

26.40 

31.01 

38.30 

80.00 

1.0 

50.00 

57.51 

68.27 

98.96 

1.5 

68.84 

76.86 

86.64 

99.99 

2.0 

82.27 

88.95 

95.45 

100.00- 

2.5 

90.82 

95.39 

98.76 

100.00- 

3.0 

95.70 

98.33 

99.73 

100.00- 

3.5 

98.19 

99.48 

99.95 

100.00- 

4.0 

99.30 

99.86 

99.99 

100.00- 

4.5 

99.76 

99.97 

100.00- 

100.00- 

5.0 

99.93 

99.99 

100.00- 

100.00- 


gives these facts for from .5 to 5.0 of each. The entries in its 
second column, for example, show that 26.40 per cent of the 
cases in a normal distribution, or of the area under a normal 
curve, fall within .5MdD or Q of the average, 50.00 per cent of 
the cases within l.OMdD or Q of the average, 68.84 per cent 
within l.SMdD or Q of it, and so on down until 99.93 per cent 
are within B.OMdD or Q of the average. To find the per cents 
within the given distances of the average in one direction only, 
halves of these entries should be used. Thus 13.20 per cent 
of them are within .5MdD or Q of the average in one direction, 
and so on. The last three coliunns give similar per cents for 
other measures of variability. 

It is also sometimes useful to know the numerical relation- 
ships existing among the measures of variability themselves. 
For a normal distribution these are as follows : 

Q = .2632Dio- 90 or .8453MD or .6745<r or l.OOOOMdD 

Dio -90 = 3.8001Q or 3.2124MZ) or 2.5631<r or 3.8001MdD 

MD = 1.1829Q or .3113Dio_9o or .7979<r or 1.1829AfdD 

tr = 1.4826Q or .3902D,o-9o or 1.2533MD or lA826MdD 

MdD - l.OOOOQ or .2632Ao-9o or M5SMD or .6745<r 
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For distributions only moderately skew they do not differ 
greatly from these figures. 

The range is not included above because, even for a normal 
distribution, its relationship to the other measures of variability 
is not constant. The reason for this lack of constancy is that 
a normal distribution theoretically has a range of infinity and 
there is no accepted basis of assuming a finite value for it that 
yields the same result for all normal distributions. Instead, 
the usual basis upon which such an assumption is made is such 
that the value given the range depends on the number of cases. 
The assumption is that the range is the distance needed to 


include (N — .5) cases, or 


N - .5 
N 


of the area under the curve. 


That is, if there are 18 cases in the distribution, the range is 
assumed to be the distance necessary to include the middle 17.5 

17 5 

of them, or the middle of the area under the curve; if 


there are 45, it includes the middle 44.5 of them; and similarly 
for any value of N. 

By finding the distance, in one direction from the average. 


that includes 




of the area under the curve and dou- 


bling it, the assumed range for any number of cases can be 
found. This has been done for a few representative niunbers 
and the results presented in Table XIII. This table shows, for 
example, that the assumed range of a normal distribution of 
10 cases is approximately 5.8Q, 1.5Dio-9o, 4.9JlfI), 3.9<r, or 
5.8MdD. In connection with this, as in many other situations, 
there are usually only small errors involved in applying the 
same figures and interpretations to moderately skewed distribu- 
tions as are applicable to normal ones. 


The relative merits and use of measures of variability 

The range is easily determined, readily understood, and may 
be used with any average. On the other hand, it is the least 
reliable and takes no account of the general shape of the 
distribution. On the whole, it is the least valuable measure of 
variability, but may often be used to supplement others, 
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TABLE XIII 

Assumed Ranges of Normal Distributions 
Containing Various Numbers of Cases 


N 

Q 

Dio- 90 

MD 

<T 

MdD 

10 

5.8 

1.5 

4.9 

3.9 

5.8 

25 

6.9 

1.8 

5.8 

4.7 

6.9 

50 

7.6 

2.0 

6.5 

5.2 

7.6 

100 

8.3 


7.0 

5.6 

8.3 

200 

8.9 


7.6 

6.0 

8.9 

500 

9.7 

2.6 

8.2 

6.6 

9.7 

1,000 

10.3 

2.7 

8.7 


10.3 

5,000 

11.5 

3.0 

9.8 

7.8 

11.5 

10,000 

12.2 


10.3 

8.2 

12.2 

100,000 

13.7 


11.6 

9.3 

13.7 

1,000,000 

15.1 


12.7 

10.2 

15.1 


The quartile deviation is a somewhat makeshift measure, 
rather unreliable, not found directly about an average. It is 
fairly easy to calculate and understand, although more difficult 
than the range in both these respects. It should be used with 
the median. 

The 10-90 percentile range is high in reliability as compared 
with other measures obtained from -ile points, but lower than 
the mean and standard deviations. It is a distance extending 
in both directions from the center, but, as Q, may be used with 
the median. It is also similar to the quartile deviation in 
relative ease of computation and understanding, but is less 
frequently employed. 

The mean deviation is of above-average reliability. It is 
somewhat difficult to compute, but readily understood. It is 
based on all the cases. The mean deviation is unique in that 
it may be found about either the mean or the median. 

The standard deviation is most favored by statisticians, but 
some of their reasons have slight pertinence to its use in most 
educational statistical work. It is a function of the normal 
curve, being the distance from the mean to the point of inflec- 
tion, that is, the point at which a line tangent to the curve 
changes direction. It is the most reliable measure of variabil- 
ity, is based upon all the cases, is useful in many other statistical 






100 


MEASURES OF VARIABILITY 


procedures, and has desirable algebraic relationships and 
properties. On the other hand, however, it is relatively diffi- 
cult to compute and the least readily comprehended by persons 
without statistical training. It gives more weight to the 
larger deviations than do other variability measures. This 
may or may not be desirable. 

Since the median deviation is regularly found by multiplying 
the standard deviation by a constant, it shares many of the 
latter’s merits and weaknesses. It is relatively easily under- 
stood and remembered because of the 50-50 division point 
which it represents. 

Of these measures, the standard and quartile deviations are 
far more frequently employed in dealing with educational data 
than are the others. The mean deviation never has been so 
used to a great extent and the other two appear to have 
declined markedly in popularity during recent years. 


The coefficient of variability 

All the measures presented in the preceding sections are 
absolute; therefore they cannot well be employed in comparing 
the variability of different sets of data except in special cases. 
The coefficient of variability or of variation may be used to com- 
pare the relative variability of different distributions. 

Various formulas for the coefficient of variability have been 
suggested. All involve the comparison of a measure of varia- 
bility with a measure of central tendency. This is based on 
the assumption that the best relative measure of variability is 
one that compares absolute variability with the size of the 
measures themselves and that a measure of central tendency 
is the best measure of their size. The formula most commonly 

employed is C of F, or just V, = 100^* The multiplier 100 is 

introduced merely to yield a result that is a whole number. 
For the data often employed previously this formula gives 


F= 100 


52.96 

253.64 


= 21 . 


The reason why the coefficient of variability is not a thor- 
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oughly satisfactory measure is that in many instances the scor- 
ing system employed does not have a true zero point or equal 
units. If these two conditions do not hold for a set of data, 
this or any other similar measure may give misleading informa- 
tion. Despite this weakness, it is the most frequently used 
measure of relative variability. 

The measurement of skewness 

Since the formulas most often employed to measure skewness 
and kurtosis are based upon measures of central tendency and 
of variability, it appears appropriate to include them here. 
The best evidence as to the degree of skewness of a distribution 
is generally that obtained by inspecting its graphic representa- 
tion, but a single numerical index is frequently useful. The 
most commonly employed formula for skewness is 

M - Mo 3(M - Md) 

Sk = or -7 

<r . a 

the latter and generally more convenient form being secured by 
substituting for Mo its formula, 3Md — 2M. A positive value 
indicates that the mean of the data exceeds both their median 
and their mode, that there is greater bunching of measures in 
the direction of low scores and greater extension or “tailing out” 
in that of high ones. A negative value shows just the opposite. 
The maximum value is ±3.00, but one greater than ±1.00 
indicates extreme skewness or asymmetry. 

Sometimes another formula, generally less satisfactory than 
the one already given, is employed. It is 

(Q, -Md)- (Md - Q.) 

Sk 0 , 

which by combining terms after removing parentheses may be 
changed to the more convenient form, 

Sk g 

Positive and negative values from it have the same general 
interpretation as those from the other formula. It tends to 
yield smaller values, which cannot exceed ±2.00 and do not 
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often exceed ±1.00. It appears to be less sensitive to small 
differences in the shapes of distributions. Except in rare 
instances, almost always when both are quite near zero, the 
results from the two formulas applied to the same data have 
the same sign. 

For the 44 cases employed throughout these chapters the 
first formula gives 

_ 3(253.64 - 242.22) _ 

52:96 

and the second 


216.00 + 275.00 - 2 X 242.22 


= . 22 . 


Thus both indicate positive skewness, but the difference be- 
tween the values they yield is somewhat greater than is usually 
the case. 


The measurement of Icurtosis 


The general procedure involved in the simpler measures of 
kurtosis is to compare a measure of the spread of the central 
portion of a distribution with that of a larger portion of it. 
A simple and frequently employed measure of this sort is 



If its value is .2632, the distribution is mesokurtic 


or normal; if it is larger than that, the distribution is platy- 
kurtic; if smaller, it is leptokurtic. It is always positive and 
ranges from .00 to .50. If it is .00, at least the middle 50 per 
cent, but not the middle 80 per cent, are bunched at a single 
point; if it is .3125, the distribution, or at least the middle 
80 per cent of it, may form a perfect rectan^e; if it is .50, all 
the cases between the tenth percentile and the first quartile are 
bunched at one point and all those between the third quartile 
and the ninetieth percentile at another. 

Another less-used measure is Ku = H this is 1.00, 

the data form a mesokurtic distribution; if it is larger than 
1.00, they form a platykurtic one; if less than 1.00, a lepto- 
kurtic one. Its lower limit is .00 and its upper limit infinity, 
but it very rarely exceeds 2.00. 
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Applied to the data so often used, these formulas give 


and 


Ku = 


29.5 


143.43 


= .21 


29.5 

37.2 


= .70, 


respectively. Since the distribution is far from symmetrical, 
measures of kurtosis are of doubtful significance for it, but, 
insofar as they are, the results of both formulas indicate that 
it is leptokurtic. 


EXERCISES AND PROBLEMS 

1 . Compute the range, the quartile deviation, the 10-90 percentile 
range, the mean deviation around the mean and the median, the 
variance, the standard deviation, the median deviation, and the 
coefficient of variability of the data in each part of Exercises 1, 3, and 
6 on pages 69 and 70. 

2 . Compute skewness and kurtosis, by both formulas for each, 
for the data in each part of Exercises 1 and 3 on page 69. 
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CHAPTER VII 


The CoefRcient of Correlation 


The meaning of correlation 

Correlation may be defined as the amount of relationship 
between paired facts or of the tendency of two or more vari- 
ables, or attributes, to concomitant variation. To study and 
find it we must have available two measures of each of a number 
of individuals. We cannot correlate data from two separate 
groups of cases. 

Since it is often helpful to know the relationship between such 
characteristics as height and weight, intelligence and achieve- 
ment, training and success, quality of work in English and in 
foreign language, and numerous others, correlations are very 
frequently computed. Although correlation procedures have 
sometimes been employed when they were not appropriate, 
they constitute one of the most useful approaches to many 
problems that arise in the field of education. 

There are numerous measures of correlation. Some may be 
either positive or negative ; others are always positive. Positive 
correlation signifies that as one of the variables or attributes 
increases, so does the other; negative that as one increases, the 
other decreases. Most of the measures have values ranging 
between .00 and 1.00 or —1.00 and 4- 1-00. Some measure 
straight-line or rectilinear relationship only; some, curvilinear. 
Most are for use with variables, but some may be employed 
with attributes, or with one attribute and one variable. A 
majority are concerned with two variables or attributes; a few, 
with more than two. 

An important point in connection with all correlation is that 
in itself it never shows causation. If two series of data are 
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perfectly correlated, this fact does not prove that either is a 
cause of the other. It may be that one is, or that some third 
factor causes both, but proof of such causation must be ob- 
tained by other means than correlation. Correlation does 
suggest the possibility of causal relationships, but no more. 

The nature of the coefficient of correlation 

The coefficient of correlation, often called the product-momerU 
coeficient, is the most commonly used measure of relationship. 
It is a measure of straight-line relationship of variables, ranging 
in value from —1.00 for perfect negative relationship through 
.00 for none or pure chance to -fl.OO for perfect positive. If 
each measure in one series is connected with the corresponding 
one in the other by a uniform first-degree algebraic equation, 
the coefficient of correlation between the two series is 1.00, 
+ or — according to whether the two series increase together or 
one increases as the other decreases. Another way of stating 
this is that two series of data correlate perfectly whenever each 
item in one can be obtained from the corresponding item in the 
other by adding or subtracting the same amount, multiplying 
or dividing by the same factor, or any combination thereof, 
throughout the scries. 

In order to illustrate more concretely the significance of 
various values of the coefficient of correlation, Table XIV has 
been prepared. Each part contains two examples of the value 
of the coefficient of correlation indicated by its heading. In 
each example the first column has been headed X; the second, 
Y. The two examples in Part A are of perfect positive cor- 
relation. In the first each value of the second variable is just 
5 less than the corresponding one of the first. In other words, 
X = y + 5 and Y = X — 5, without exception. In the 
second example of Part A the uniform relationship is not so 

Y 

apparent, but nevertheless exists, since invariably X = — 4 

and Y = 2X -1- 8. The examples in Part B show a strong 
tendency for high values of X to be paired with high values 
of Y, but no uniform relationship exists. In C high values of 
one variable are as. likely to be associated with low as with 
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TABLE XIV 

Examples or Various Values of the 
Coefficient of Correlation 


Part A 

Part B 

Part C 

Part D 

Part E 


Perfect 



High 






High 



Perfect 



Positive 



Positive 







Negative 



Negative 


X 

Y 

X 

Y 

X 

Y 

X 

Y 

X 

Y 

X 

Y 

X 

Y 

X 

Y 

X 

Y 

X 

Y 

95 

90 

26 

60 

98 

99 

24 

13 

75 

54 

36 

54 

42 

17 

57 

18 

94 

29 

31 

16 

90 

85 

38 

84 

97 

94 

22 

17 

75 

61 

85 

24 

41 

19 

21 

45 

86 

33 

31 

16 

85 

80 

15 

38 

95 

93 

35 

19 

72 

42 

77 

68 

37 

22 

43 

28 

84 

34 

33 

15 

80 

75 

21 

50 

94 

92 

15 

9 

70 

31 

41 

73 

36 

21 

52 

24 

84 

34 

35 

14 

75 

70 

33 

74 

91 

87 

17 

7 

70 

55 

28 

66 

35 

24 

38 

33 

80 

36 

39 

12 

70 

65 

30 

68 

89 

90 

21 

12 

69 

27 

52 

48 

31 

27 

36 

35 

78 

37 

41 

11 

65 

60 

28 

64 

88 

91 

29 

15 

64 

33 

62 

53 

30 

33 

27 

40 

72 

40 

43 

10 

60 

55 

29 

66 

88 

86 

38 

21 

61 

44 

53 

61 

26 

38 

29 

42 

70 

41 

47 

8 

55 

50 

18 

44 

85 

83 

17 

9 

58 

34 

34 

42 

24 

37 

59 

16 

70 

41 

47 

8 

50 

45 

24 

56 

82 

79 

9 

5 

55 

60 

66 

70 

24 

40 

46 

23 

68 

42 

49 

7 

45 

40 

31 

70 

78 

76 

15 

7 

51 

48 

21 

32 

23 

42 

31 

37 

64 

44 

53 

5 

40 

35 

37 

82 

71 

73 

23 

13 

46 

50 

43 

79 

18 

46 

45 

26 

62 

45 

55 

4 


high values of the other, thus indicating that the coefficients 
of correlation are low, near .00. Part D presents the reverse 
of B, a marked tendency for high values of one to be with low 
values of the other. Finally in E arc two examples of perfect 
negative correlation. In the first X always = 152 — 2Y and 
X 

Y = 7% — and in the second X always = 63 — 2Y and 

Y = 31.5 - |- 

Since the coefficient of correlation measures rectilinear rela- 
tionship, a perfect or 1.00 value thereof means that if the data 
are plotted they lie on a straight line. If the coefficient is 
+1.00, this line extends from lower left to upper right on a 
conventional graph, in harmony with the coordinate axis 
system, and if it is —1.00 the line extends from upper left to 
lower right. If all the data lie on a straight line parallel to 
either axis, all the values of one variable are the same, and the 
formula for the coefficient gives an indeterminate value which 
can be shown to equal .00. As the coefficient becomes pro- 
gressively less than 1.00, the distribution of the data departs 
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more from a straight line, until as it approaches .00 the shape 
assumed approaches a circle, a rectangle, or some irregular 
form. 

The interpretation of coefficients of correlation will be dis- 
cussed later, but it appears appropriate here to state that from 
at least two standpoints the coefficient is a miniTniim measure 
of relationship. Since it measures rectilinear relationship only, 
if additional relationship of a curvilinear nature exists, the total 
relationship is greater than that which is measured. Also, if 
there are errors in the data, their effect is usually to lower the 
value obtained, so that it is less than the actual or true value. 

Sometimes the fact that the measuring scale used is composed 
of unequal but systematically changing units produces artificial 
curvature. In such a case the skew distribution caused by 
this condition should be transformed into a normal one by the 
method given in Chapter XIII and then the coefficient of 
correlation determined. 

Computing coefficients of correlation of ungrouped series 

There are many forms in which the formula for the co- 
efficient of correlation may be given. All are reducible to the 
same expression, hence yield the same result for the same data. 
Which to employ is a matter of convenience and preference. 
Only the few which seem most frequently useful will be 
presented here. 

The simplest form is 

N<Tx(Ty 

or, dividing through by N, 

N 

r = — 

O’xO’ y 

In this r is the symbol for the coefficient of correlation, x and y 
are the deviations of the measures from their respective means,‘ 

^ Small letters, the same as the large ones used to denote the measures them- 
selves, may be employed instead of d for deviations from the mean. Thus x and 
dx mean the same, y and dy, and so on. 
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and 2, N, and <r are the same as in previous chapters. If, as 
usually, assumed instead of true means are used, it becomes 

'S.xy 

'Lxy - N 

r = — -* or 

Since the c’s and (t’s are not actually needed, the formula is 
sometimes written in terms of the expressions from which they 
are obtained rather than of them, thus: 

^ _ N'Sxy — SxSy 

^ ~ - (2x)2] [N2y^ - (Zy)*] 


or 


^xy - 


'Lx'Ly 

N 



If the assumed mean is taken as 0, a practice sometimes con- 
venient, the formula is in terms of actual scores, X and Y, 
rather than of deviations, and is 

2Xy _ w 

XXY - NMMy N * ^ 

r = ^ or 

JS<Tx<ry (TlCry 

or 

ZXY - NMxMy 
V(2l:2 - NMx^XSY^ - NMy^) 


if means are employed, and 


4 - ^’) 

if they are not. 

The next three tables. Tables XV, XVI, and XVII, present 
the three most common variations of the formula applied to 
ungrouped series of data. They differ in where the assumed 
mean is taken. The entries in the first column of each are 
JQ’s, those in the second per-cent marks on a test. In Table XV 
the scores themselves are used directly, which is equivalent to 
assuming the mean at zero. This is frequently known as the 
Ayres Method. The first two columns contain the scores. 



TABLE XV 

Computation of Coefficient of Correlation of Ungrouped Scores Directly 
FROM Scores Themselves 
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J Ot>-OcD’^00’^C00b0001^*-HQ'^C0C^C00b ( 

^ OS i-t o i-n-f CO «0 CO »-• lO «) 0> 0> O N* tH < 
10qT-iOSOSOOSQOOOt^QOt»t»OCO(Ot«0'^iO ^ ' 


COO^^O’-^OS»OCO O OSOS 
b-O00MQ-i<C^CM05 00 OO 
t^-^rHCD^WCOOO i-t COOl 
»0 CO »0 Tti O »0 CO Tfi o CO CO • 
<M .. .. 


<NC 0 l>-C 0 (M 05 C 0 Tt<»-ii-<OQCD»-HC 0 c 6 CN'^ 05 CJ 

CS|i-iOO’-HCSOSCO'«t<OOOOCMOOcOCNTl^M<0>t^COki 

OOt^iO-^t<CO<Ni-t»-HOOOOSOOSOOQOl^t^t^C 




C<l Ni lO OS »0 00 CO »-< *-< CO Q »C O OS CO lO ' 
OS OS OS 00 00 00 1>- 00 00 OO !>• t>» t'* »0 CO , 


»OwcOOS»O'^ 00 l^'^:^ 1 -«OOO^C^OIOSOOCO^ »o ^ 
COCOC<| 1 -|»-H 1 -HOOOOOOSOSOOSOSOOOOOOOO 00 o 
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labelled X and Y, the next two, headed X* and F®, their 
squares, and the last or XY column their products. All 
are totalled, and the sums divided by N, here 20. The results 
for the first two columns are the means, 104.25 for the X scores 
and 78.80 for the Y scores. The steps at the bottoms of the 
next two colxmins are the same as on page 89, with the appar- 
ent but not real difference that M® instead of c® is subtracted 
to give <r®. It is not real because in this case, with the assumed 
mean at 0, M = c. Therefore the two standard deviations, 
14.83 and 9.98, are found as on the page just mentioned. Finally 
from the sum of the entries in the last, or XY, column divided 
by N the product of the two means is subtracted. Here 
SXF = 166,774, which divided by 20 gives 8,338.70. This 
less MJdy, 8,214.90, leaves 123.80, labelled p, which is the 
symbol sometimes used for the numerator of the formula. 

At the foot of the table r is found by two variations of the 
formula. First, using 


,, ,, 

jy ~ M xM y 
r — > r = 

<7 a:<T y 


123.80 

14.83 X' 9.98 


Second, employing the next to the last form. 


+ .84. 


166774 - 20 X 104.25 X 78.80 


r = 


v'(221761 - 20 X 104.25®) (126180 - 20 X ”78.80^) 


= -f- .84. 


Any of the others may also be used, with the same result. 

It is worth noting that the numerator of the final fraction 
obtained from the second variation of the formula is N times 
that obtained from the first variation employed. Thus here, 
2,476 = 20 X 123.80. Likewise the denominators exhibit the 
same relationship. Here, 2,959.79 is not exactly 20 X 148.00 or 
2,960 because some decimals have been dropped. Furthermore, 
each <r equals the square root of the corresponding term in 
parenthesis in the denominator divided by N. In this case, 


14.83 = yj- 
9.98 = yj- 


221761 - 20 X 104.25® 
20 


126180 - 20 X 78.80® 
20 


and 
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The same relationships also hold for the data in Tables XVI 
and XVII. 

Table XVI presents the same computation with the use of 
assumed means near the smallest scores. In this method each 
assumed mean is taken as the smallest score in the series or 
some “round” number slightly smaller. In this case, since the 
smallest X value is 81, the assumed mean of the X’s is taken as 
80 and, since the smallest Y is 55, the assumed mean of the 
F’s as 50. The column headed x contains the remainders after 
subtracting 80 from each X score, and that headed y the re- 
mainders after subtracting 50 from each Y score. The next 
two contain the squares of the x’s and j/’s, and the last contain 
their products. The same steps are carried out as for the data 
in Table XV. giving the same result. 


123.80 

14.83 X 9.98 


+ .84. 


Also, the last of the formulas above for use with deviations is 
employed, with the totals of the five columns. It gives 


16444 - 


r = 


485 X 576 
20 




.8580 -f-’) 


= + .84. 


The advantage of the method just described is that the sizes 
of the numbers dealt with are much reduced. Whether this 
reduction is suflScient to pay for the additional step of subtract- 
ing the assumed means from the actual scores depends upon 
their relative sizes and the relative facility of the computer in 
handling niunbers of different sizes. If the range of scores is 
much greater than the smallest one, so that the ratio of reduc- 
tion by subtracting assumed means is small, so doing is not 
worth while. For example, if the X scores ranged from 81 to 
535 instead of to 135, subtracting 80 would hardly be worth 
the trouble. Similarly, if they ranged from 21, instead of 81, 
to 135, subtracting 20 would not pay. 

Table XVII illustrates the method in which the assumed 
means are taken close to the true means. This has the advan- 
tage of giving even smaller munbers than the previous method. 
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but the disadvantage of introducing the minus signs. It is, 
except for the last column and the final steps, the same as the 
short method of computing the mean, given on page 66, and 
of finding the standard deviation, on page 89. In form it is 
just the same as that in Table XVI. The same two variations 
of the formula are employed, with the same result. 

Sometimes all the scores in a series can be evenly divided by 
the same factor. If so, such division may be worth while. It 
is merely a question of whether the reduction in size is sufficient 
to outweigh the extra labor of the additional step. If all end 
in one or more O’s which may be dropped, it is evidently worth 
doing so, but otherwise it may or may not be. 

Computing coefFicIents of correlation of grouped series 

The process of computing coefficients of correlation of 
grouped series is the same in principle as that for ungrouped 
series, but must be carried out in different form. The same 
principles that determine whetherjor not a single series should 
be dealt with in ungrouped or grouped form apply in cases of 
correlation of two series. Generally, if there are more than 30 
or 40 cases, they are grouped; if there are less, they are not. 
The resulting table is known as a correlation table, a table of 
double entry, or a double frequency table. It is constructed in 
harmony with the coordinate axis system. The scale for one 
series, usually the first, is laid off upon the X or horizontal axis; 
that for the other, upon the Y or vertical axis. A table is thus 
formed, composed of a number of cells or compartments, each 
of which contains a number showing how many cases have the 
Y value of the row and X value of the column which intersect 
to form the cell. The X values are commonly indicated by 
figures or other symbols, called column headings or captions, at 
the top of the table; the Y values, by others, called stubs, at 
the left. 

To prepare such a table, the worker first should examine each 
series and, in accord with the suggestions given in Chapter II, 
decide how to classify it. One additional consideration is that 
the numbers of classes in the two series correlated should be 
approximately the same. Having done this, he should prepare 
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the table, insert the captions and stubs, and tabulate the data. 
Probably the most convenient method of doing the latter is to 
place a tally mark, made with a fairly soft pencil, in the proper 
square for each pair of numbers. After doing this once, he 
should repeat the process and compare results, to eliminate 
probability of error. If results agree — or after they are cor- 
rected, if they do not — an entry showing the number of tally 
marks in each cell should be entered therein. If entry is made 
with ink or a hard pencil, the tally marks can easily be erased 
without disturbing it, and a clean, neat table left without the 
labor of preparing one and then recopying it. 

To illustrate the construction of a correlation table, and later 
the computation of the coefficient thereof, the data in Table 
XVIII may be employed. The first series are the total scores 

TABLE XVIII 

Scores of a Class on a Series of Short Tests 
AND on the Final Examination* 


Short 

Tests 

Final 

Short 

Tests 

Final 

Short 

Tests 

Final 

Short 

Tests 

Final 

Short 

Tests 

Final 

336 

130 

291 

116 

259 

115 

200 

95 

128 

76 

321 

114 

272 

108 

186 

96 

198 

96 

353 

126 

226 

102 

258 

111 

222 

95 

209 

102 

267 

112 

270 

109 

202 

93 

174 

79 

143 

78 

326 

119 

333 

131 

250 

109 

315 

119 

126 

75 

267 

104 

351 

132 

251 

105 

230 

98 

291 

118 

182 

94 

274 

106 

339 

125 

215 

100 

325 

128 

252 

100 

175 

94 

269 

100 

317 

129 

235 

106 

175 

95 

202 

99 

165 

94 

296 

124 

238 

101 

263 

118 

241 

101 

237 

116 

292 

109 

183 

106 

184 

107 


* These scores form the basis of Tables XIX, XX, and XXI. 


of 50 students on a series of short tests; the second, their scores 
on the final examination. For example, one student had a total 
of 336 points on the short tests and made 130 on the final, 
another had 321 and made 114, and so on. 

Table XIX shows the initial form of a correlation table for 
the data in Table XVIII. Classes with an interval of 20, 
be ginn i ng at 120, have been chosen for the total scores and 
with an interval of 5, beginning at 75, for the final examination 
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scores. A tally mark for each pair of scores is then placed in 
the proper cell. Thus for the first pair, 336 and 130, such a 
mark has been placed in the cell formed by the intersection of 
the 320-column and the 130-row; for the next, 321 and 114, in 

TABLE XIX 

First Step in Construction op Correlation Table 
Containing Data in Table XVIII 

Final 
Exam. 

Score ■ 

IZO- 

125 - 

120 - 

115 - 

110 - 

105 - 

100 - 

95 - 

90 - 

85 - 

80 - 

75 - 


Total Score on Short Tests 


120 - 


140 - 


160 - 


180 - 


200 - 


220 - 


240 - 


260 - 


280 - 


300 - 


320 - 


340 - 


// 


// 


// 


// 


// 


// 


// 


// 


// 


// 

/ 


// 


// 


// 


/// 


// 


that made by the intersection of the 320-column and the 110- 
row; and so on imtil all are entered. This has been checked 
and the final result is as shown in the table. 

These data have been entered in the next two tables, but with 
figures instead of tally marks in the cells. Also, they have been 
totalled both ways and the results entered in the total or fre- 
quency column and row. When this has been done, they are 
ready for the computation of the coefficient of correlation 
between the two series. In Table XX this has been done by 
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the use of assumed means just below the lowest scores, that is, 
by the so-called Ayres Method. As applied to a correlation 
table, it provides that the deviations assigned the classes begin 
with 1 for the lowest class and increase by one per class until 
the largest equals the number of classes. In this instance, 
since each variable is grouped in twelve classes, both x and y 
range from 1 up to 12. 

The next two rows, labelled fx and fx^, and the next two col- 
umns, headed /j/ and/y*, are the same as in the computation of 
the standard deviation shown on page 90, except that there 
an assumed mean near the true mean is used. In other words, 
each fx entry is the product of the / and x entries in the same 
column or row, and each fx^ entry the product of the cor- 
responding X and /a: entries, and similarly for y. 

The Sx column is new. Each entry therein is the sum of the 
products of the entries in that row of the table times the x values 
of their respective columns. Thus, for the first row, 

2 X 11 + *1 X 12 = 34; 

for the second, 

1 X 10 + 2 X 11 -I- 1 X 12 = 44; 

and similarly for the others. The total of the Sx column, here 
341, should equal the total of thc/x row. For the Sxy col umn 
each entry in the Sx column is multiplied by the y value of the 
same row. Thus 12 X 34 = 408, the first such product; 
11 X 44 = 484, the second; and so forth. This column is 
then summed, the result, in this instance, being 2,701. 

The two rows below the wavy line are not necessary, but 
serve to check part of the work. The first, labelled Sy, con- 
tains entries computed similarly to those in the Sx column, but 
vertically rather than horizontally. For example, the Sy entry 
for the 180-column is 28, the sum of 

2X7-1-2X6-1-1X4. 

The total of the Sy row should equal that of the fy column. 
Here both arc 343. Each Sy entry is multiplied by the cor- 
responding X, the result entered in the Sxy row, and the total 
found. It should, of course, equal that of the Sxy column, in 
this case 2,701. 
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TABLE 
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Correlation Table with Assumed Means Below Lowest Scores 


320- 

340- 

f. 

2 

1 

3 

2 

1 

4 



1 

1 


7 

1 


3 



9 



8 



7 



4 



0 



0 



4 

6 

2 

50 

11 

12 


66 

24 

341 

726 

288 

2743 

63^ 


343 

693 

276 

2701 


fy 



Xxy 

36 

432 

34 

408 

44 

484 

44 

484 

10 

100 

9 

90 

63 

567 

60 

540 

24 

192 

26 

208 

63 

441 

61 

427 

48 

288 

52 

312 

35 

175 

33 

165 

16 

64 

15 

60 

0 

0 

0 

0 

0 

0 

0 

0 

4 

4 

7 

7 

343 

2747 

341 

2701 


— — ft »9 r — — r 

fji = ?Z1? = 54.8600 SI = = 54.9400 

50 50 


c? = 46.5124 
a? = 8.3476 
(Tx = 2.889 

2701 


cj - 47.0596 
<rl = 7.8804 
= 2.807 


^ = 54.0200 
50 

CxCy — 46.7852 
p = 7.2348 

7.2348 _ 7.2348 _ g. 

^ ~ 2.889 X 2.807 8.1094 
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After the totals of the fx and fx^ rows, of the fy and fy^ 
columns, and of Zxy have been obtained, the coefficient of 
correlation may, as in the case of ungrouped series, be com- 
puted by any of a number of variations of the general formula. 
They are the same as those previously given for ungrouped 
series, but the work is done in terms of deviations, expressed in 
intervals, rather than in either actual scores or deviations 
expressed in original units. In Table XX the computation of r 
by two methods, one involving the finding of the corrections 
and standard deviations, and the other not, is shown. 

The method used at the lower right of the table is based upon 
the fourth formula for r given above. The corrections and 
standard deviations are found, as back in Table X, giving 
Cx = 6.82, c„ = 6.86, o-* = 2.889, and (r„ = 2.807, all in terms 
of intervals. Since Xxy = 2701, 


2xy ^ 2701 
N 50 


54.0200. 


Subtracting from this leaves 7.2348, the numerator of the 
final fraction. 


r = 


7.2348 


2.889 X 2.807 


Two points not essential to this computation appear worth 
mentioning in connection with it. If the means are desired, 
they may be found by the usual formula, M = AssM + d. In 
this case the assumed mean is always the mid-point of the next 
lower class below the lowest one in the table. Therefore, 

Mx = 110. -I- 6.82 X 20. = 246.4 
and 


My = 72.5 -f- 6.86 X 5 = 106.8. 


Secondly, if the standard deviations are to be reported as such, 
they should be in terms of original or score units, not of intervals 
as they are employed in the calculation. That is, 

<r. = 2.889 X 20. = 57.78 


and 


<ry = 2.807 X 5. = 14.035. 
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The other of the two methods of finding the coefificient of 
correlation illustrated in Table XX employs the second of the 
formulas vised when the c's and <r’s are not needed. Substituting 
the proper totals in this formula gives 


2701 - 


341 X 343 




= + .89, 


as by the other method. 

It is true here, just as for Tables XV, XVI, and XVII, that 
the numerator and denominator of the fraction yielded by the 
second method are each N, in this instance 50, times those of 
the fraction resulting from the first method. Also, in both 
places, each <t is the square root of the corresponding quantity 
in parenthesis divided by N. 

Table XXI contains the same data and the computation of 
their coefficient of correlation by the same two formulas used 
in Table XX, but with assumed means taken near the true 
means. Since some of the steps are the same as those for 
means and standard deviations, and all are the same as those 
in Table XX, they are not explained here. Both formulas 
yield the same final fraction and value of r as in the previous 
table. 

A number of so-called correlation charts, really work sheets, 
intended to assist those who are computing coefficients from 
correlation tables, have been published. They regularize pro- 
cedure by providing a specific place for each step, often with 
explanation right at hand, but the writer docs not believe that 
they offer much advantage over the use of cross-section paper 
accompanied by the habit of placing each step in the same 
position. 

Grouping data to be correlated sacrifices accuracy for con- 
venience, just as any grouping does. It tends to yield lower 
coefficients than would be obtained from the exact data. If 
the series correlated are not very dissimilar in shape and if the 
number of classes in each is ten or greater, the inaccuracy is 
generally small. A study by the writer indicated that with 
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TABLE 

Computation of Coefficient of Correlation of Data in 


Final 


Total Score on 


Exam. 

Score 

120- 

140- 

160- 

180- 

200- 

220- 

240- 

260- 

280- 

130- 










125- 










120- 









1 

115- 






1 

1 

1 

2 

110- 







1 

1 


105- 




2 


1 

2 

3 

1 

100- 





2 

2 

2 

2 


95- 



1 

2 

2 

2 




90- 



2 

1 

1 





85- 










80- 










75- 

2 

1 

1 







fx 

2 

■ 

4 

5 

5 

6 

6 

7 

4 

X 

-6 



■ 

B 

-1 

0 

+ 1 

+2 





H 

B 

-6 

-64 

+7 

+8 


72 

25 

64 

45 

1 20 

6 

0 

■Bi 

16 

Xy 


-6 

-14 

n 




+ 1 

'"'+ 7 ' 

^xy 

1 +72 

+30 

+56 

+21 

+18 

+4 

0 




363 - 


(-9)(-7) 

60 




361.74 

405.53 
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XXI 

Correlation Table With Assumed Means Near True Means 

Short Tests I I I I I 


300 - 

320 - 

340 - 


2 

1 

1 

2 

1 




1 

1 



1 























2 

6 

2 

+3 

+4 

+5 

+6 

+24 

+10 

18 

+6 

96 

'*+ 2 r 

50 

+9 

+ 18 

+84 

+45 


y 

fy 

fy^ 

2 x 

Xxy 

+5 

+ 15 

75 

+ 13 

+65 

+4 

+ 16 

64 

+ 16 

+64 

+3 

+3 

9 

+2 

+6 

+2 

+ 14 

28 

+ 11 

+22 

+ 1 

+3 

3 

+5 

+5 

0 

+51 

0 

-2 

0 

-1 

-8 ' 

8 

-4 

- 1-4 

-2 

-14 

28 

-16 

+32 

-3 

-12 

36 

-13 

+39 

-4 

0 

0 

0 

0 

—5 

0 

0 

0 

0 

-6 

-24 

144 

-21 

+ 126 


-58 

395 

+47 

+363 


-7 


-56 



— = -.18 cy = — == -.14 
50 " 50 

^ = 8.3800 s; = ^ = 7.9000 
50 "so 


d = .0324 
(d = 8.3473 
ff. = 2.889 
363 


d = .0196 
d = 7.8804 
<7. = 2.807 


-50 = 

CxCy — .0252 

p = 7.2348 


2,889 X 2.807 8.1094 


= +.89 
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only five classes in each variable over half the differences are 
less than .05; that with ten classes in each over two-thirds 
are less than .02; and that with twenty classes about three- 
fourths are less than .01. 

A correction for grouping may be obtained by substituting 
the corrected value of each a, as given on page 91, for the 
actually obtained value. The formula given there, for a 
expressed in intervals, was 



Emplo3dng this for the data in Table XXI gives 
= \/^889^- .083.3 = 2.875 

*corr 

and 

= V2.807=* - .08;i3 = 2.792, 

whence 

^ 7.2348 ^ , on 

2.875 X 2.792 ’ 

an increase of only .01 from -f-.89, the uncorrected value. 

Computing intercorrelations among a number of variables 

The various intercorrelations among a number of variables 
are often needed. These must be based upon the same cases 
in each; hence their intercorrelations involve repeated use of 
the same totals, means, corrections, standard deviations, and 
so forth. Only the products of the pairs of variables are unique 
to each correlation. Therefore anyone who is computing such 
intercorrelations should so plan his work as to avoid unneces- 
sary duplication. Table XXII illustrates how this may be 
done for ungrouped series. The first four columns, headed A, 
E, G, and 7, contain the school marks of 20 pupils in algebra, 
English, general science, and industrial arts. They were given 
in terms of a five-letter marking system, but have been trans- 
muted into numbers from 5 to 1. The next four columns 
contain their squares; and the last six, the various products. 
These are then totalled and the means, standard deviations. 



TABLE XXII 

Computation of Intercorrelations Among a Number of 
Series of Ungrouped Variables 
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numerators of the final fractions, and finally the values of the 
six possible correlation coefficients found by the method first 
employed in Table XV. The resulting coefiicients, given at 
the bottom of Table XXII, show a range in values of from 
+.30 up to +.81. 

Limitations upon the use of coefFicients of correlation 

It has already been stated that coefficients of correlation 
measure only rectilinear relationship, that variable errors in 
the data tend to cause the values computed to be less than 
those derived from perfectly accurate scores, and that they 
never prove causative relationship. In addition, there are 
certain other limitations and cautions that should be heeded. 

One of these has to do with averaging. The ordinary mean 
of two or more correlation coefficients is not at all sure to be 
very close to the value that would be obtained if the data from 
which the two or more are derived were all combined and a 
single coefficient determined. If the means and standard 
deviations of each variable, as it appears in the several cor- 
relations, and also the numbers of cases, are each approximately 
the same, then the mean is probably close to the value for all 
the data combined. Two methods have been suggested as 
better, but since one regularly yields results larger than the 
mean, and the other smaller, it is doubtful which, if either, 
should be employed. Probably the better of the two calls for 
taking the square root of the mean of the squares of the 
coefficients to be averaged. Thus for r’s of +.48, +.52, +.61, 
and +.75, the result is 

^■ 48 ^ + . 52 =‘ + . 61 =^ + . 75 ^ = _|_ 60 

If some of the r’s are positive and some negative, this method 
is not valid. 

What is called spuriotis correlation arises through the effect 
of an irrelevant factor upon the variables being correlated. In 
such cases, the computed coefficient is correct mathematically, 
but liable to misinterpretation because of neglect of the ir- 
relevant factor. For example, if total scores on a test are 
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correlated with those on one of its subtests, the fact that the 
subtest is contained in the total serves to increase the obtained 
coefficient. In general, if a subtest is uncorrelated with the other 
subtests composing the whole test, the coefficient of correlation 

between scores on it and on the whole is ■J-, in which n is the 

number of times as variable the whole is as the subtest. For 
example, the coefficient between total scores and those on a 
subtest one-fourth as variable as the whole test and uncor- 
related with the other three-fourths is Vi = -1-.50. If each 
variable is multiplied or divided by the same factor, and it is 
correlated with them, the obtained coefficient is increased. 
For example, two series of mental ages obtained at an interval 
of time may be divided by chronological ages to secure intelli- 
gence quotients. If the MA’s and C.4’s are positively cor- 
related, the IQ’s will be more closely correlated than were 
the MA’s. 

Another caution is that coefficients of correlation computed 
from groups of unequal variabilities, or ranges of talent, in the 
traits concerned are not directly comparable. The greater the 
variability, the greater will be the obtained value of the 
coefficient, other things being equal. For example, an obtained 
coefficient between chronological age and mental age in a single 
school grade is often rather low, whereas a similar coefficient 
for a whole elementary school of six or more grades is usually 
high. Similarly the correlation between two duplicate forms 
of a test obtained by giving it to a single grade is generally 
much less than that for the same test given to several grades. 
Another way of stating the same fact is that a coefficient ob- 
tained by combining groups with different means is generally 
larger than that in a single one of the groups. 

To render coefficients obtained from different ranges com- 
parable, formulas to estimate what coefficients would be in 
different ranges are available. The most frequently useful is 
that for two series of measures of the same thing, such as scores 
on duplicate forms of the same test. If subscripts 1 and 2 are 
employed to denote the two situations, 

estimated ri = 1 — ^ (1 — J^). 
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To illustrate this, we may assume that a coefficient of correla- 
tion of -f-.60 between the two forms of a test has been found 
in a relatively narrow range and one of -1-.80 for another test in 
a broader range, that the standard deviations are 10 and 16 
points,* respectively, and that we wish to compare them. The 
estimated coefficient that would be obtained for the first test 
in a group with a standard deviation of 16 instead of 10 is found 
to be 

1 - J^*(l - .60) = + .84. 

Since this exceeds +.80, the first test appears to possess greater 
self-correlation than the second. The same conclusion can 
also be reached by estimating the coefficient of the second test 
if it is given to a group with a standard deviation of only 10. 
In this instance the formula gives 

1 - ^'(1 - .80) = + .49, 

which is less than +.60, and therefore indicates that the second 
test has the lower self-correlation. In comparing coefficients 
of reliabiUty of tests, the procedure just described should always 
be followed unless it is known that they have been derived from 
groups with equal variabilities. 

For cases in which the two variables correlated are not 
similar measures of the same thing, that is, in which reliability 
is not involved, a more general formula must be employed. 
The exact one is rather complicated, but a simpler one usually 
yields approximations close enough to be reasonably satisfac- 
tory. It is 

estimated ^ V < “ ’ 

or the same with the subscript y instead of x used with each <r. 
If the formula is employed with both x and y and the results 
averaged, a better estimate than from one alone is secured. 

To illustrate the formula just given, the following measures 

* In a real situation such as this, the standard deviations of the two duplicate 
forms of each test would probably be slightly different. If they are, their average 
should be used in the formula. 
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may be employed, with subscript “1” referring to the smaller 
range and “2” to the greater: 

rx^v^ = + -45, = + .77, = 4.6, ax^ = 6.4, = 6.0, 

and (Tvj = 7.8. 

The question to be answered is which value of r indicates 
closer relationship? 

As is possible for reliability data, so for these the value of 
either coefficient corresponding to the obtained value of the 
other may be estimated. The estimated value for the narrower 
range, estimated from that obtained for the broader by using 
the standard deviations of X, is 

Tx^y^ = -[q V4.6^ - 6.42(1. - .77^) = + .46 

and that by using those of Y is 

Vx^y^ = g ^ ^ 7.8H1. - .77^) = + .56. 

The average of +.46 and +.56 is +.51, which is the best 
estimate of the value corresponding to +.77 when the range is 
reduced. Since this is greater than +.45, the value of r actu- 
ally obtained for the reduced range, it indicates that a closer 
relationship is shown by r = +.77 wheno-* = 6.4 and Cy = 7.8 
than by r = +.45 when <r* = 4.6 and Cy = 6.0. 

The same conclusion is reached by estimating the value of r 
for the broader range. Thus, using the standard deviations 
of A, 

W 2 = U V'6.4^ - 4.6^1. - .45=) = + .77 

and, using those of Y, it equals 

^ \/7.8» - 6.02(1. - . 452 ) = + .73. 

These average +.75, which is less than +.77, and hence sup- 
ports the conclusion stated above. 

Interpreting coefficients of correlation 

. The interpretation of coefficients of correlation in terms of 
how much relationship they indicate is difficult. Because of 
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the close connection of regression equations and certain error 
formulas therewith, it seems well to postpone the major treat- 
ment of the matter until after the discussion of those topics. 
Two or three points will be stated here, however. Several 
writers have suggested interpreting them in terms of adjectives 
such as “negligible,” “low,” “fair,” “good,” “high,” and so 
forth, but the present writer does not believe this practice has 
much to recommend it. There are so many bases of judgment 
and comparison that the subjective element present in such 
terms is quite large. 

Without any implication that either variable causes the 
other, and on the assumption, rarely completely valid, that 
the factors are not correlated with one another and that those 
common to the variables exert the same average effect as those 
not common. 



In this Uc stands for the number of common factors, ni for 
the number in one variable, and for that in the other. 
Thus, if, within the conditions stated, 4 factors are common to 
two variables, one of which has 8 and the other 10 factors, their 


4 

10 


+ .45; 


if 6 are common to two, of which one has only 6 and the other 
12, their 


6 

Vo X 12 


+ .71; 


and similarly for others. 

Sometimes it is helpful to interpret a coefficient in terms of 
the displacement associated with it, that is, of the differences 
in positions of the measures in one series from those of the cor- 
responding measures in the other. Such displacement may be 
stated in at least two ways. One is to divide each series into 
parts, generally equal, such as halves, thirds, fifths, and so on, 
and state to what extent corresponding measures do or do not 
fall in the same parts of the two series. The other is to express 
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the agreement and shift in terms of some measure of variability, 
by stating the fractions of cases whose positions in the two 
series differ by less than, as much as, or more than various 
niunbers of standard, median, mean, or quartile deviations. 

Since the second of the two general methods just mentioned 
is more conuno nly emp loyed than the first, it will be amplified. 
By dividing y/2 — 2r into values of the measure of deviation 
employed .5 less® and ,5 more than the difference in position 
concerned, using the results as entries in the first column of 
such a table of the normal curve as is given in the Appendix, 
finding the corresponding areas between pairs of ordinates at 
the given deviation distances, and finally doubling them, the 
per cents of cases falling within divisions the positions of which 
differ by the given deviation distances may be found. This 
sounds more difficult than it is. Let us assume that we wish 
to express the amounts of displacement in terms of standard de- 
viations. If r = +.60, = .8944; 0 -5- .8944 = .0000 

and .50 -f- .8944 = .5590. Taking these as a values in the first 
column of the standard deviation table in the Appendix, the cor- 
responding areas between ordinates at those distances from the 
mean and the mean are .0000 and .2118. Twice the difference, 
.2118, is .4236, the fraction of cases that, for r = +.60, fall 
within the same division, l.OOff wide, in the two series. To 
find the fraction falling in divisions 1.00a apart, values of <r 
of .50 and 1.50 are used. The area within .50 -t- .8944 has 
already been found to be .2118. 1.50 h- .8944 = 1.6771, and 

the included area corresponding to this is .4531. 2 (.4531 — 
.2118) = .4826, the desired fraction. Similarly it can be found 
that .0886 of the cases fall within divisions 2.00a apart, .0051 
within those 3.00a apart, and less than .0001 within those 
more than 3.00a apart. If we desire to know the fraction 
not more than a certain number of standard deviations apart, 
we merely total those up to and including that distance. 
Thus .4236 + .4826 + .0886 = .9948, which is the fraction 
not more than 2.00a apart when r = .60. 

To enable the reader to apply this method without going 

* In the case of corresponding divisions, rather than those that differ in positions, 
a value of 0 rather than of —.5 should be used. 
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through the computation just illustrated, Table XXIII has been 
prepared. It is in terms of the median or quartile rather than 
the standard deviation and shows, for each of a number of 
values of the coefficient of correlation, the per cents of cases the 
positions of which in one series differ by the given amounts from 
their positions in the other. Thus, the row for r = +.80, for 

TABLE XXIII 

Per Cents of Cases Displaced in Terms of Median or Quartile Deviations 
Corresponding to Given Values of the Coefficient of Correlation 


Median or Quartile Deviations 


r 

0 

1 

2 

3 

4 

5 

6 

7 

+1.00 

100. 








+.99 

98. 

1.7 







+.98 

91. 

9.2 







+.97 

83. 

17. 







+.96 

77. 

23. 

.03 






+.95 

71. 

29. 

.14 






+.90 

55. 

43. 

2.4 

.02 





+.80 

41, 

48. 

10. 

.78 

.02 




+.70 

34, 

47. 

16. 

2.7 

.21 




+.60 

29. 

45. 

20. 

5.1 

.76 

.07 



+.50 

26. 

43. 

22. 

7.4 

1.6 

.22 

.02 


+.40 

24. 

40. 

23. 

9.3 

2.5 

.49 

.07 


+.30 

23. 

38. 

24. 

11. 

3.6 

.80 

.15 

.02 

+.20 

21. 

37, 

24. 

12. 

4.6 

1.3 

.29 

.05 

+.10 

20. 

35. 

24. 

13. 

5.5 

1.8 

.46 

.09 

.00 

19. 

34. 

24. 

14. 

6.3 

2.3 

.67 

.16 


example, shows that for this value 41 per cent of the cases fall 
within corresponding divisions l.OOMdD or Q wide in the two 
series; 48 per cent fall in divisions so wide and l.OOMdD or Q 
apart; 10 per cent in such divisions 2.00MdD or Q apart; 
.78 per cent in such divisions Z.OOMdD or Q apart; and .02 per 
cent in such divisions A.OOMdD or Q apart. Furthermore, by 
adding entries from the left, one can find that 89 per cent fall 
in corresponding divisions not more than l.OOMdD or Q apart; 
99 per cent in divisions not more than 2.00MdD or Q apart; 
and so on. 

A more detailed and specific method of interpreting the 
significance of coefficients of correlation that is especially helpful 
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when prediction is involved requires the use of tables too 
elaborate to be included here. Such tables may be given in 
various forms. Jackson and Phillips^ have prepared two sets 
of tables which will be found quite helpful for this purpose. 
Those of the first group show the theoretical per cent of each 
tenth of one variable associated with each tenth of the other 
for various values of the coefficient. Those of the second 
contain the per cents of success and failure by tenths for failure 
ratios of 20, 30, 40, 50, 60, 70, and 80 per cent for the same 
values of r. Both are based upon normal distributions of the 
two variables. 

If, as is often necessary or at least convenient, a coefficient of 
correlation obtained from a small number of cases is to be taken 
as the most probable value for a large number of similar cases, 
of which the small number is a sample, a corrected value is 
likely to be more accurate than the actually obtained one. It 
may be obtained by the fornxula, 

r„rr = -Jl - (1 - r=) (^— !)• 

For example, if r from 20 cases is +.50, the most probable value 
for the total population is 

V' - » - “’>(§^1) - + 


The effect is always, as here, to reduce the obtained value. 

A probably better but more difficult formula for the same 
purpose is 

r(l-r2)/ 1 - 5r2 \ 

Tcon r - 1) V 4(iV - 1)/ 


For the same data this gives 


^corr — ‘SO 


.50(1 - .502) / 
2(20 - 1 ) \ 


1 - 5 X .502\ ^ 
4 ( 2 0 - 1) / 


.49, 


also smaller than uncorrected r but not so much so. 


* Jackson, Robert W. B., and Phillips, Alexander J., Prediction Efficiencies by 
Deciles for Various Degree of Relationship, Educational Research Series No. 11. 
Toronto: Department of Educational Research, Ontario College of Education, 
University of Toronto, 1945. 18 pp. 
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EXERCISES AND PROBLEMS 


1. Compute the coefficients of correlation of the following paired 
series: 


(a) 

(b) 


(c) 

(d) 

(e) 



(f) 

36 

38 

215 

17 

59 

61 

2 

15 

105 

1 

8 

1250 

46 

49 

180 

14 

46 

73 

7 

24 

96 

4 

11 

1350 

96 

91 

169 

9 

55 

72 

3 

11 

121 

0 

4 

1100 

17 

22 

202 

11 

37 

81 

4 

17 

104 

1 

10 

1400 

94 

95 

197 

12 

32 

79 

6 

12 

89 

6 

6 

1200 

62 

74 

223 

15 

53 

64 

11 

22 

102 

3 

5 

1150 

31 

28 

186 

13 

47 

72 

7 

13 

109 

2 

24 

1450 

60 

59 

174 

10 

42 

68 

8 

18 

93 

5 

2 

1050 

90 

93 

200 

10 

54 

66 

9 

23 

81 

9 

6 

1300 

32 

27 

210 

12 

61 

54 

5 

14 

106 

4 

7 

1250 

58 

66 

196 

8 

47 

75 

4 

25 

112 

1 

13 

1400 

78 

83 

179 

7 

42 

80 

6 

19 

98 

5 

6 

1250 

44 

45 

188 

9 

51 

70 

10 

18 

90 

6 

7 

1200 

28 

32 

193 

11 

60 

61 

8 

12 

85 

7 

3 

1100 

72 

71 

205 

14 

52 

74 

3 

29 

123 

1 

1 

1000 

80 

84 

190 

13 

39 

83 

7 

20 

108 

2 

2 

1050 

86 

87 

177 

12 

45 

74 

6 

27 

87 

8 

9 

1350 

50 

54 

201 

11 

48 

63 

6 

17 

102 

3 

21 

1400 

79 

82 

209 

15 

53 

68 

12 

16 

99 

4 

7 

1200 

65 

62 

186 

9 

43 

79 

7 

17 

114 

0 

9 

1250 



194 

11 

38 

44 

8 

14 

103 

2 

3 

1100 



172 

8 

52 

67 

4 

12 

91 

5 







40 

78 

6 

14 

86 

6 







53 

68 



136 

0 







36 

75 








2. Tabulate each of the following sets of data in a correlation 
table: 

(а) 83-89, 89-76, 72-83, 83-96, 89-74, 89-82, 83-96, 81-68, 82-74, 89-82, 

75- 88, 83-74, 66-75, 77-84, 86-91, 95-88, 90-80, 85-73, 8^84, 78-76, 
68-67, 96-85, 85-89, 94-71, 81-76, 88-85, 86-93, 73-70, 72-84, 82-82, 

76- 81, 71-84, 79-80, 68-72, 93-87, 83-85, 78-76, 74-83, 85-82, 81-89, 
90-92, 91-87, 73-76, 75-82, 91-95, 80-84, 77-83, 84-87, 80-88, 77-76, 
74-72, 84-88, 69-68, 76-74, 83-86, 89-89. 

(б) 16-10, 12-11, 10-34, 11-12, 18-16, 11-11, 10-12, 13-13, 10-10 18-41, 
15-26, 14-12, 18-39, 20-35, 18-36, 12-11, 10-24, 13-34, 19-33, 12-16, 
21-24, 10-16, 10-21, 14-22, 11-27, 19-36, 17-24, 12-13, 13-18, 12-22, 
14-13, 14-21, 15-30, 10-25, 11-14, 15-33, 13-32, 23-42, 16-18, 14-29, 
13-25, 18-21, 15-21, 12-31, 10-17, 16-32, 12-17, 16-36, 14-11, 15-20, 
11-15, 18-33, 12-23, 17-21, 15-21, 16-37. 

(c) 220-1800, 70-2200, 35-2000, 200-2500, 33-2000, 279-3150, 18-1485, 
135-2700, 26-1675, 80-2000, 43-1800, 17-1300, 68-2400, 39-1600, 
112-2000, 63-1500, 40-2400, 197-3500, 30-1480, 23-2300, 42-1800, 
52-2450, 175-3250, 182-2500, 70-2000, 243-3500, 62-2400, 113-1500, 
140-3150, 168-2500, 180-2900, 75-1500, 30-2100, 95-2200, 90-2200, 
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37-2000, 40-2350, 244-3900, 62-2400, 52-2400, 17-1500, 140-2500, 
170-3400, 135-2400, 70-2000, 158-3000, 16-1250, 118-2310, 12-1170, 
74-3000, 267-3100, 23-1900, 27-2200, 135-2200, 165-4000, 90-2700, 
84-2250, 18-1575, 49-2200, 27-1620, 86-2900, 160-2000, 250-3200, 
111-2100, 54r-1950. 

(d) 3-119, 25-402, 1-18, 10-220, 4-92, 6-91, 2-24, 4-44, 6-116, 9-165, 
10-154, 4-52, 8-119, 24-365, 6-86, 17-306, 2-33, 6-71, 1-30, 2-20, 

6- 73, 2-38, 6-66, 7-87, 9-155, 6-90, 8-112, 5-60, 6-121, 4-97, 4-64, 

7- 114, 6-115, 10-153,5-105, 12-255, 11-291, 7-77, 6-150, 4-48, 8-199, 
6-69, 5-100, 9-145, 11-205, 16-270,6-144, 4-46, 8-118, 4-88, 1-15, 
9-116, 5-120, 6-85, 10-179, 6-96, 3-39, 5-80, 15-253, 3-63, 6-108, 
5-78, 6-123. 


3 . Compute the coefficient of correlation for each of the correla- 
tion tables made for the data in Exercise 2. 

4 . Compute the intercorrelations of each of the following sets of 
data: 


(a) ( 6 ) 


X 

Y 

Z 

1 

2 

S 

A 

64 

196 

42 

. 5 

14 

58 

118 

52 

174 

34 

9 

27 

79 

135 

87 

214 

65 

6 

11 

64 

114 

79 

218 

58 

1 

6 

38 

96 

71 

185 

44 

7 

15 

82 

105 

80 

195 

64 

12 

29 

95 

148 

66 

164 

48 

4 

6 

74 

104 

55 

152 

37 

8 

12 

83 

121 

73 

178 

52 

6 

15 

65 

99 

84 

290 

67 

3 

10 

43 

88 

58 

165 

39 

9 

16 

92 

110 

73 

188 

53 

2 

4 

51 

93 

77 

191 

47 

10 

25 

96 

124 

82 

194 

62 

8 

12 

88 

106 

68 

183 

49 

7 

13 

62 

108 

80 

201 

56 

4 

7 

43 

93 

71 

182 

38 

6 

10 

60 

108 

69 

176 

45 

8 

15 

65 

117 

67 

171 

46 

5 

9 

40 

99 

62 

164 

43 

3 

7 

43 

94 


6 . Which situation, of each pair stated below, indicates the closer 
relationship? 

(а) r = -f.75, <r = 5.5, or r = -f 92, a = 8.0. 

(б) r = 4*.40, a\s = 4.0 and 3.8, or r = +.85, (r’s = 6.4 and 6.6. 

(c) r = +.54, as = 3.1 and 3.2, or r = +.70, <r’s = 4.5 and 4.7. 

(d) +.62, fx^v^ = +.81, (Txj = 3.2, ar^ ~ 4.2, ay^ = 12.5, ay^ — 

16.5 

(e) = +.34, rr^u^ = +.96, (Txj = 4.4, a^^ = 7.8, ay^ = 7.3, ay^ = 
13.6. 



136 


THE COEFFICIENT OF CORRELATION 


6 . What is the coefficient of correlation in each of the following 
situations, assuming equal and uncorrelated factors? 

(а) 3 common factors, 5 in one variable, 6 in other. 

(б) 4 common factors, 5 in one variable, 12 in other. 

(c) 8 common factors, 10 in one variable, 14 in other. 

7 . What is the coefficient of correlation between the total test 
score and that on a part of the test uncorrelated with the other parts 
if the part is half as variable as the whole test? What if it is one- 
sixth as variable? 

8 . Find the most probable value of the coefficient of correlation 
for the total population for each of the following obtained coefficients 
and sizes of samples: 

(а) r = +.65, iV = 25 (c) r = +.48, iV = 42 

(б) r = +.86, AT = 30 (d) r = +.74, AT = 28 
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CHAPTER VIII 


Regression and the Interpretation of 
the Coefficient of Correlation 


The meaning and function of regression 

It frequently occurs in educational and other work that it is 
desirable, even essential, to estimate or predict the score on 
one variable that is most likely to be associated with a given 
score on another variable with which the first is correlated. 
When this is done, there is a’ tendency for the predicted score 
to be relatively nearer the mean of its series than the score from 
which it was predicted is to the mean of its series. This tend- 
ency is called regression. The equation employed in making 
the prediction is the regression equation, and the graphic repre- 
sentation of this equation is the regression curve or curve of 
regression. 

Just as correlation may be either rectilinear or curvilinear, 
so may regression. It also resembles correlation in that the 
one most frequent — indeed, almost the only — method em- 
ployed is rectilinear. If the relationship between the variables 
is curvilinear, rectilinear regression does not enable us to make 
the best estimates, but because the use of curvilinear regression 
is much more difficult and because in many cases where regres- 
sion is employed the relationship is at least approximately 
rectilinear, rectilinear regression is used almost to the exclusion 
of curvilinear. 

The regression coefRcIenfs and equations 

For every pair of correlated variables, there are two regres- 
sion equations, one for estimating each variable from the other. 

137 
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Since they require only the means, standard deviations, and 
coefficient of correlation, they are easily found. There are 
several variations of form in which they may be expressed, the 
chief difference being that either the scores themselves or their 
deviations from means may be employed. The first is usually 
more convenient. Using X and Y to designate the two vari- 
ables, the regression equation for estimating X when Y is 
known is 


X = r^Y + M^-r^ My 

ffy Cy 

and that for Y when X is known is 

Y = r~ ’'X + My - M^. 

<Sl Ot 

The term is known as the regression coefficient and is symbol- 
ized by b. Hence 


bx = r— and by 



The regression coefficient shows the slope of the regression line, 
as the rectilinear regression curve is called, or the average 
amount of change in one variable associated with a change of 

one unit in the other. Often b is substituted for r- in the 

<r 

formulas above, which then become 


and 


X = bxY + Mx- bxMy 


Y = byX + My- byMx. 


The data in Tables XV, XVI, and XVII may be employed 
to illustrate the computation of regression equations. For 
them, Mx — 104.25, My = 78.80, or* = 14.83, ay = 9.98, and 
r = -1-.84. Substituting in the two regression equations gives 

X =. 84 lt|r + 104.25 -. 84M;| 78.80 




and 
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By performing the indicated multiplication and division and 
combining the last two terms, these reduce to X = 1.25F + 5.9 
and Y = .57X + 19.9. Thus, if X = 110, for example, the 
value of Y most likely to be associated with it is 

.57 X 110. + 19.9 = 83; 

if F = 60, the most likely value of X associated with it is 
1.25 X 60. + 5.9 = 81; 

and similarly for other cases. In these equations, the regres- 
sion coefficient of X on F — that is, 5* — ^is 1.25 and that of F 
on X — that is, — is .57. 

In order to make the meaning of regression coefficients and 
equations more real, Figure 19 has been inserted. It is a cor- 



relation graph of the data just employed to compute the regres- 
sion equations, with the individual cases and the regression 
lines shown. The line that is lower at the left and higher at 
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the right represents the regression of X on Y, that is, the equa- 
tion for estimating X when Y is known, and the one that is 
higher at the left and lower at the right that of Y on X. It 
will be observed that the two lines intersect at the means. 

The horizontal distance, parallel to the X-axis, of each 
point from the X on F line represents the difference between 
the estimated and the actual value of each X case, and the 
vertical distance, parallel to the F-axis, from the F on X line 
represents the difference between the estimated and actual 
value of each F case. For example, the equation for F gives 
.57 X 135. -f- 19.9 = 97 as the most likely value of F asso- 
ciated with X = 135, whereas it actually is 92. The difference, 
5, is what is known as the error of estimate. The lower the 
correlation, the larger the errors; and the greater, either 
positive or negative the correlation, the smaller the errors; 
until when r = d=1.00, all the cases lie on the line of regression 
and there are no errors. The size of errors in scores estimated 
by regression equations will be discussed more fully in a later 
section. 

As a second illustration of the computation of regression 
equations, the data in Tables XVIII, XIX, XX, and XXI 
will be used. For them, M, = 246.4, = 106.8, c* = 57.78, 

<Ty = 14.04, and r = -1- .89. Therefore, 

KM = 

and 

and the regression equations are 

X = 3.66F + 246.4 - 3.66 X 106.8 

and 

F = .22X + 106.8 - .22 X 246.4, 

which reduce to X = 3.66F — 144.8 and F = .22X -|- 53.5. 

Regression coefficients, similarly to correlation coefficients, 
may be computed without actually finding the means, standard 
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deviations, and coefficient of correlation. The formulas, in 
terms of actual scores, are 


sxy 


6. = 


N 


SF2 _ 


(sy)2 

N 


and 


by-- 


ZXY - 


sx= 


(2X)’‘ 


N 


and in terms of deviations the same with x and y instead of 
X and y, respectively. 

For the benefit of readers familiar with the method of least 
squares, it seems well to mention that the regression equations 
are the same as those of the lines of best fit employed in that 
method. They may be found by the solution of simultaneous 
equations involving terms that correspond to b and the constant 
that is added or subtracted. 

Sometimes scores are expressed in terms of their standard 
deviation distances from the mean. In other words, such a 


X — M 

score equals - ^ , in which X represents the obtained score. 

It is variously called standard, sigma, and z-score. When scores 
are so expressed, the standard deviation of their distribution 
becomes 1.00. Therefore /3 (beta), the symbol employed for 


the regression coefficient of standard scores, equals 7’^ or just r. 


Using this in the regression equations gives, for such scores, 


Zx = rzy and Zy = rz^. 

The terms dependent variable and independent variable often 
occur in connection with regression. The dependent variable is 
the one estimated or predicted from the other, whereas the 
latter is known as the independent variable. 
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The Interpretation of coefficients of correlation and 
regression equations 

Generally the most helpful method of stating the degree of 
relationship indicated by a given value of the coefficient of 
correlation — ^which is another way of sasdng the amount of 
error present in scores estimated by a regression equation — is 
by means of the coefficients of alienation and of depend ability . 
The coefficient of alienation, S 5 Tnbolized by k, equals \/l — r*. 
The coefficient of depe ndability, symbolized by Em, equals 
1 — or 1 — Vl — r*. Sometimes the latter is multiplied by 
100 and called the index of 'prediction', that is, 

7, = 100 - lOOfc or 100 - 100 

To interpret k and Em, one should think of a scale of the 
error involved in estimated scores extending from no error at 
all — ^the condition when r = ±1.00 — down to the amount of 
error involved when all are simply estimated at the mean — a 
condition accompanying which r = .00. The coefficient of 
alienation, k, shows what fraction the amount of error con- 
nected with a given value of r is of the amount when all are 
estimated at the mean. The coefficient of dependability, Em, 
shows the fractional reduction in amount of error for the value 
of r with which it is associated. The second and third columns 
of Table XXIV give the values of k and Em for each of a number 
of values of r. For example, if r = ±.95, the amount of error 
in scores estimated by the regression equation is .3122 as great 
as if all were taken at the mean and, conversely, .6878 better 
than if all were so taken. 

The most evident fact revealed by this table is that, for high 
values of r, small decreases therein are accompanied by large 
increases in k and large decreases in Em', whereas, for small 
values of r, large increases are accompanied by small changes 
in k and Em- Many persons are accustomed to think of values 
of r in the .90’s as indicating quite close relationship, whereas 


^ A more accurate value is given by 

but the formula is rarely employed. 
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TABLE XXIV 

Values op Em g , and Eg Corresponding 
TO Given Values of r. 


r 

k 

E^ 

9 

E, 

±1.00 



.0000 

1.0000 

±.99 

.1411 

.8589 

i .0998 

.9002 

±.98 




.8593 

±.97 

.2431 

.7569 

.1719 

1 .8281 

±.96 

.2800 


.1980 


±.95 

.3122 

.6878 


.7792 

±.94 

.3412 

.6588 

.2413 

.7587 

±.93 

.3676 

.6324 


.7401 

±.92 

.3919 

.6081 

.2771 

.7229 

±.91 

.4146 

.5854 



±.90 

.4359 

.5641 


.6918 

±.866 

.5000 


.3536 

.6464 

±.85 

.5268 

.4732 

.3725 

.6275 

±.80 

.6000 


.4243 

.5757 

±.75 

.6614 

.3386 

.4677 

.5323 

±.70 

.7141 

.2859 



±.65 

.7599 




±.60 

.8000 


.5657 

.4343 

±.55 

.8352 

.1648 



±.50 

.8660 



.3877 

±.40 

.9165 


.6481 

.3519 

±.30 

.9539 


.6745 

.3255 

±.20 

.9798 




±.10 

.9950 



.2964 

.00 

1.0000 


1 .7071 

.2929 


the table shows that this is far from true. To illustrate this 
graphically, Figure 20 has been inserted. 

The entries in the column headed k may also be employed 
somewhat differently. The common method of describing the 
size of the errors 0 / estimate — that is, the differences between 
scores estimated by regression equations and actual scores — is 
to state a measure of their variability. Of such measures the 
standard error of estimate, which is merely the standard devia- 
tion of the errors, and the probable error of estimate, which is the 
median of the errors, are the two most commonly employed. 
For the former, the usual symbol is cri.2 or Oesn but sometimes 
C1.2 (epsilon) or €«« is employed, ^ and, for the latter, PE1.2 or 


* The use of c instead of cr is preferable, in order to avoid confusion with a as 
the standard deviation, but such use is rare. 
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PEm- Since the formula for € 1.2 is o-^Vl. — r* or aik, the entries 
in the second colunm of Table XXIV may be employed in 
computing ei. 2 . Likewise, since 

PEi.i = either .6745ei.2 or MdDJs, 
the entries may be used for that also. 

VALUE OF 



.00 .20 .40 .60 .80 1.00 

VALUE OF r 

Fig. SO. Values of E„ and k Corresponding to 
Given Values of r 

For the data in Tables XV, XVI, and XVII, already em- 
ployed for correlation and regression, 

e,.» = 14.83 \/l. - .84’ = 8.05 

and 

€,.* = 9.98 Vl. - .84= = 5.42. 

In other words, if the regression equation for X, which is 
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X = 1.257 + 5.9, is used to estimate its values, 68.27 per cent 
of them will be in error by less than 8.05, and so on. 

PE^., = .6745€,.», 

which is .6745 X 8.05 or 5.43; hence 50 per cent of the errors 
will be less than 5.43; and so on. Readers should note that 
the first sjnnbol in the subscript of an error of estimate denotes 
the variable being estimated and the second denotes the one 
from which it is estimated. 

The standard and probable errors of estimate, being standard 
and median deviations, may be interpreted as any other such 
deviations, following the discussion in Chapter VI. That is, 
68.27 per cent of the erroi's of estimate are less than their 
standard error, 95.45 per cent are less than twice their standard 
error, and so on, for any value of it or of the probable error, 
according to the tables in the Appendix. In this, as in all 
similar cases, these interpretations are based upon the assump- 
tion of normal distribution. 

If secured from a small sample and taken as appl 3 dng to the 
total population, the standard or probable error of estimate, 
just as the coefficient of correlation, should be corrected. The 
correction is merely the multiplication of the obtained value by 



Thus, for the 20 cases for which €x.» was found to be 8.05, its 
corrected value is given by 



The last two columns of Table XXIV may be used in another 
method of stating the amount of error or reduction of error 
associated with values of the coefficient of correlation. It has 
not been frequently employed, but is sometimes confused with 
the other. This method differs from that associated with k 
and Em in that the point of maximum error is taken as being 
the situation resulting when the scores in two series of variables 
are matched by pure chance or guess, such as by drawing from 
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boxes or hats. The symbol g is used to represent the guessing 
or error element; E,, for that of reduction in error or certainty. 
For example, if r = ±.75, the errors in scores estimated by 
the regression equation are .4677 as large as they would be in 
pure guesses, or the reduction in error is .5323 over that. The 


formula for g is merely 



or .7071k, and that for E, is 


1 - or 1 - .7071fc or .7071F„ + .2929. 

Another method of interpretation involves r* and A:*, known 
as the coefficient of determination and of nondetermination, 
respectively. This method applies only if one variable is 
associated with all of another plus one or more other factors. 
In such a situation the coefficient of determination, that is, r*, 
represents the fraction of the variance in the first variable 
associated with the second, and the coefficient of nondetermina^ 
tion, k^, represents the fraction of the variance in the first 
associated with other factors than the second.* Thus if, in the 
situation from which the data employed in Tables XV, XVI, 
XVII, and elsewhere were obtained, all of X plus other factors 
are associated with Y, .84* or 71 per cent of the factors in Y 
are associated with X and 29 per cent are not. 

The coefficients of correlation and alienation and of deter- 
mination and nondetermination are closely connected with the 
standard deviations and variances of the variables. If is 

the standard deviation of the portion of Y associated with X 
and <r„, the standard deviation of the portion of Y not associated 
withX, 





$, and fc* = 1*. 
<r* cl 


If one variable — ^in this case, Y — ^is associated with only a 
part of another variable — ^here X — ^plus other factors, the deter- 
mination of the first by the second is the square of their 
coefficient of correlation divided by the square of the coefiScient 
between the common factor or factors and the second variable. 


• Readers should recall that r* -f A:* * 1.00. 
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That is, using c to denote the common factor, the determination 
of F by Z is 

lij! 

lex 

and the nondetermination is 



Estimated true scores 

It is sometimes desirable to secure the best estimates of 
what scores would be if they were perfectly reliable. The 
formula for estimating true scores is 

Z; = rX + (1 - r)M or r(Z - M) + M, 

in which X'„ stands for the estimated true score,^ r for the 
coefficient of reliability — that is, of correlation between dupli- 
cate forms or two administrations of the test — and X for the 
actually obtained or fallible score. 

To illustrate the application of this formula, we may assume 
that for a certain test the mean score is 66 and the coefficient 
of reliability is +.88. Then the estimated true score for a 
fallible score of 80 is 

.88 X 80. + (1 - .88)66. = 78.; 
that for one cf 58 is 

.88 X 58. + (1 - .88)66. = 59.; 

and so one for others. By performing the indicated operations 
the last term becomes 7.92; hence for this situation the formula 
for any estimated true score becomes .88Z’ + 7.92. The reader 
should note that an estimated true score is always nearer the 
mean than is the fallible score from which it is obtained. 

EXERCISES AND PROBLEMS 

1. Compute the regression equations of the data in Exercises 1 
and 2 on pages 134 and 135. 

2. In Exercise 1, Part (a), on page 134, what is the estimated value 

^The prime (') regularly signifies an estimated rather than an actual score and 
the subscript <» (infinity) refers to a true measure. 
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of the second variable corresponding to each of the following values 
of the first? 40, 56, 81, 95. 

3. In Exercise 2, Part (6), on page 134, what is the estimated value 
of the first variable corresponding to each of the following values of 
the second? 14, 18, 26, 30. 

4. Compute the values of r^, fc, Emy g, and Eg for each part 
of Exercises 1 and 2 on pages 134 and 135. 

6. Find the standard and probable errors of estimate for each 
variable for the data in Exercises 1 and 2 on pages 134 and 135. 

6. Find the corrected values of the standard and probable errors 
of estimate found in Exercise 5 if they are to be taken as applying to 
a total population. 

7. If the first variable is considered to be completely associated 
with the second, find the standard deviation of that part of the 
second associated with the first, and of that part of the second not 
associated with the first, for each part of Exercises 1 and 2 on pages 134 
and 135. 

8. If the common factor in the variables has a coefficient of cor- 
relation of .70 with one of them, find the determination and non- 
determination of the other by it for the data in each part of Exercises 
1 and 2 on pages 134 and 135. 

9. Find the equation for estimated true scores in each series for 
Parts(a) of Exercises 1 and 2 on page 134. 

10. Apply the equations found in Exercise 9 to estimate the true 
scores for fallible scores of 45 and 78 in the first series in Part (a), 
Exercise 1, and for fallible scores of 76 and 92 in the second series in 
Part (a). Exercise 2. 
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CHAPTER IX 


The Reliability of Scores 


The coefRcIent of reliability 

It has already been stated that the coefficient of correlation 
between two administrations of the same test, or two forms 
thereof, is called the coefficient of reliability. Other synonymous 
terms are coefficient of self -correlation and retest coefficient. The 
definition just given may be broadened to include any two series 
of scores or measures of the same individuals obtained by the 
same measuring instrument or procedure. In terms of its 
significance, the coefficient of reliability is the fraction of the 
test score that measures whatever it is that the test measures, 
the remaining part being due to variable errors. 

To distinguish the coefficient of reliability from other 
applications of the coefficient of correlation, its symbol often 
has a subscript consisting of the Arabic and Roman forms of 
the same figure. If a single test is in question, it is regularly 
7-11 ; for a second test it is ran; and so on. 

Although the coefficient of reliability is the most frequently 
employed measure of the reliability of a test or a set of scores, 
it is far from satisfactory. The chief reason for its weakness 
is the fact, already stated in Chapter VII, that it is so largely 
affected by the range of scores. One should not judge, there- 
fore, the relative reliability of measuring instruments by their 
coefficients of reliability unless he knows that the ranges of 
scores upon which they are based are at least approximately 
equal or that they have been made comparable by the procedure 
given in Chapter VII. Later in this chapter other measures 
of test or score reliability will be presented. 
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The Spearman-Brown formula 

An important factor in reliability is length. It is a matter 
of common observation that a large sample of anything usually 
yields more accurate or reliable information concerning the 
whole than does a small sample. The same condition holds in 
testing. A test of 40 elements, for example, ordinarily results 
in more reliable scores than does one of 20 elements. The 
Spearman-Brown formula, sometimes called the prophecy for- 
mula, deals with the relationship between length of measuring 
instrument and reliability. It is based upon several assump- 
tions, as follows: that the material in the instruments is 
homogeneous, that is, similar in type of content, difficulty, 
form, time per clement, and other pertinent characteristics; 
that the conditions of testing are the same; and that fatigue 
does not enter in sufficiently to affect performance on the 
longer test. When these conditions are met, the formula gives 
results in close agreement with empirical data. It is 


" i ■+ {n - l)rn’ 

in which n refers to the number of times a test is lengthened or 
the ratio of length of the longer to the shorter and r„ to the 
coefficient of reliability of the longer test. 

As an example of the use of this formula, if a test of 20 ele- 
ments for which rn = -f-.50 is lengthened to have 60 elements, 
that is, to be three times as long, and the conditions stated 
above are met, the coefficient of reliability of the lengthened 
test may be expected to be close to 

3 X .50 
1 -I- (3 - 1).50 


= + .75. 


As a second example, we may determine whether a test of 30 
elements with r^ = -f.45 would be more or less reliable than 
one of 100 elements with ru = -1-.72 if it were made the same 
length. Since 100 30 = d'^i, the formula gives 

3H X .45 


^^n(here 3 1/3) 


1 + (3M - l)-45 


= + .73, 


which indicates that it would be slightly more reliable. 
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The most frequent application of this formula is in deter- 
mining the coefficient of reliability of measuring instruments 
which have only one form or for which the administration of 
two forms is impracticable. The procedure is to compute 
two scores for each individual, usually one from the odd- 
numbered and one from the even-numbered elements. The 
coefficient of correlation between these two series of scores is 
the coefficient of reliability of half the test. To get that of 
the whole test, the Spearman-Brown formula, with n = 2, is 
employed. This is 

2rii 

1 + (2 - l)rii 

which reduces to 


2r,i 

1 + n{ 

Thus, if the coefficient of reliability of half a test is -f-.65, that 
of the whole test is 


2 X .65 
1.65 


+ .79. 


Since the conditions surrounding one giving are practically 
identical for scores on odds and on evens, whereas those for 
two administrations are liable to differ, coefficients of reliability 
secured thus are likely to be slightly higher than those from 
two administrations of either the same or duplicate forms. 
There are also other reasons why the two types of coefficients 
are not exactly comparable, but they are commonly used as if 
they were. 

Since the Spearman-Brown formula applies to scores on 
series of homogeneous tests administered under similar condi- 
tions, the reliability of total scores derived therefrom may be 
estimated by it. The coefficients of reliability of the tests 
composing the total will vary, but the average coefficient of 
those of a given length may be used for ru. For example, if 
the average coefficient of 20 short tests of the same length is 
-t-.40, that of the total score may be expected to approximate 


20 X .40 
1 + (20 - 1).40 


-I- .93. 
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It is sometimes useful to reverse the formula, to find how 
much to increase length to secure a desired higher reliability 
coefficient. For example, if a test of 50 elements has a co- 
efficient of +.63, how many elements are necessary to yield a 
coefficient of +.90? Solving the formula for n gives 


= ^’nC l - rii) , 
rii(l - r„) 


Substituting in this gives 


n 


.90(1 - .63) _ ,2 y . 
.63’(1 - .90) 


5^7 X 50 = 264 elements, the number needed. The same 
formula may be used in reducing length and reliability. Thus, 
if a test of 300 elements has a coefficient of +.97 and one of 
+.95 is sufficient, 

n = = -5876 and .5876 X 300 = 176, 

the needed number. 

The Spearman-Brown formula also applies approximately in 
other situations, such as those involving the number of judges 
or raters and of suggested responses in multiple-answer tests. 
If elements in such a test are of approximately equal difficulty, 
or if scores are weighted according to difficulty, the relationship 
between reliability and number of responses follows this 
formula. 


Attenuation and its correction 

The presence of variable errors in correlated measures tends 
to reduce obtained coefficients of correlation below w’hat they 
would be if no such errors were present. This reduction in 
coefficients of correlation is called attenuation. By means of the 
formula to be given it can be eliminated, and the value which a 
coefficient would have if both series were perfectly reliable can 
be estimated. In most instances there is no good reason for 
doing so, but in some experimental and other research studies, 
particularly some of those that endeS'^^or to determine causa- 
tion, it is helpful to do so. 
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The general form of the formula to be used to correct for 
attenuation involves dividing a measure of the correlation be- 
tween the two series by a measure of their reliability. In order 
that the latter be available, there must be two series of measures 
of each variable. Therefore, there are, in such cases, four series 
in all. The two series of measures of one variable may be sym- 
bolized by 1 and I, those of the other by 2 and II. Four cor- 
relation coefficients between the two variables are possible, 
since 1 may be correlated with both 2 and II, and 2 with both 
1 and I also. The two coefficients of reliability are, of course, 
1 with I and 2 with II. When more than one coefficient is 
employed in either numerator or denominator of the formula, 
the geometric mean is used as being the best average thereof. 

The best form of the formula contains all possible coefficients; 
thus, 

= ^ ^i2^inri2riii 

Since almost as good results are secured by using only two 
coefficients, 1 with II and l with 2, in the niunerator, the usual 
form employed is 

/ riiiri2 

"V riir2ii 


It is possible to use a single coefficient, without the radical sign, 
in the numerator, but this is not recommended. 

To illustrate correction for attenuation by the usual formula, 
let us suppose that the coefficient of 1 with II is +.56; of I 
with 2, +.61; the reliability coefficient of 1 with I, +.78; and 
of 2 with II, +.91. Then, 


^12.. 


V 


.56 X .61 
.78 X .91 


+ .69, 


the estimated coefficient if both variables were perfectly reliable. 

Inspection of the formula makes it evident that the higher 
the reliabilities of the two series of scores, the less the increase 
in the coefficient between them when the effect of unreliability 
is removed, and vice versa. If they are perfectly reliable, there 
is no increase at all. 
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The index of reliability 

The ind£x of reliability is the coefficient of correlation between 
the scores actually obtained and the theoretical true scores. 
It is the square root of the coefficient of reliability, that is, 
therefore, except when both are .00 or 1.00, it is always 
larger than the coefficient. The index really is a better measure 
of reliability than the coefficient, since agreement with true 
scores is a better criterion of consistency than is agreement 
with another series of fallible scores, but it has never come into 
common use. Since the index and coefficient are so closely 
and unvaryingly related to each other, both lead to the same 
conclusions in the comparison of measuring instruments as to 
their reliability. Likewise the index is affected by the range of 
talent or variability of scores; hence, it is subject to the same 
limitations of interpretation in this respect as is the coefficient. 

Errors of measurement 

It has already been stated that other measures of the reliabil- 
ity of scores are preferable to the coefficient. The error of 
measurement, and its ratios to the mean and standard devia- 
tion, are the measures meant. An error of measurement^ also 
called error of a score and error of observation^ is the difference 
between an actual and a true score. In other words, it is the 
amount by which an obtained score differs from what it would 
be if it were perfectly accurate. Another definition of an 
error of measurement is that it is the error of estimate involved 
with an actual score and a perfectly reliable true score. 

Errors of measurement, just as errors of estimate and others, 
are generally measured by their standard and median devia- 
tions, called the standard error of measurement and the probable 
error of measurement. The usual formula^ for the first is 

^ Several other formulas are available. One that has much merit but is seldom 
used is 



in which d is the difference between scores on equivalent halves. 
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ei.« or 0 - 1 . <0 or or = aVl — 
and for the second 

PEi.a or PEmtat = .6745<rv"l — ru or AfdDVl — rn. 

These formulas are obtained from those for the standard and 
probable errors of estimate by substituting the index of reliabil- 
ity for the coefficient of reliability employed therein. In each 
case the coefficient used is that of the correlation between the 
two series of scores involved. Thus, 

= <rVl - 

which equals crVl — ru. Since reliability usually involves two 
forms of a test and their standard or median deviations are 
likely to differ slightly, the -average value is regularly taken. 
Hence, the formula for the standard error of measurement 
becomes 

and that for the probable error 

or 

MdT}\ MdDi t- 

2 VI -r„- 

Thus if a test has a coefficient of reliability of +.84 and the 
standard deviations of its two forms are 10.2 and 10.4, its 

10.2+10.4,/:; ^ ,, 

ei.® = 2 VI — .84 = 4.1 

and its 

Pi/i.® = .6745 X 4.1 = 2.8. 

The standard and probable errors of measurement are inter- 
preted as are any standard and median deviations and con- 
stitute satisfactory measures of the absolute sizes of test errors 
due to unreliability. Because they are absolute rather than 
relative, however, they are not valid for comparing tests unless 
similar scores thereon happen to have the same significance. 
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Just as an error of a foot is generally a serious one in the 
measurement of the length of a small room but not in the 
measurement of the distance from one town to another, so an 
error of 5 points is much more serious on a test of 25 elements 
than on one of 250 elements. To secure relative measures of 
error, absolute measures thereof must be compared with 
measures of the opportunity for error. 

Two measures of opportunity for error have been commonly 
employed for the purpose just stated. They are the mean and 
the standard deviation. Either the standard or the probable 
error of measurement may be compared with them by simple 
division, it being used as the dividend and the mean or the 
standard deviation as the divisor. The probable error is 
employed in this manner more frequently than is the standard 
error. If two forms of a test with different means and standard 
deviations are concerned, the average values are used. There- 
fore, the usual forms of the formulas for the ratios are 


which reduces to 


and 


.6745 


Ml + Mr ‘ 


2 


.6745 


{(Tl + 0’i)Vl — r,! 

Ml + Ml ’ 


which becomes 


.6745 


2 

CFi + CTi 


ni 


.6745V1 - m. 


If the means of the two forms of the test mentioned on the 
previous page are 68.3 and 69.1, the ratios of its probable error 
of measurement to the mean and standard deviation are 


2.8 

68.3 + 69.1 


= . 04 , 


and 


2 
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i0^4 0'-««6Vi-:r8j-.27, 

2 

Each of these two relative measures has its merits and de- 
merits, hence its adherents and opponents. The mean, and 
therefore the first ratio, can have its value materially changed 
merely by adding a number of very easy elements to those 
already in a test, by removing from a test its easiest elements, 
and in various other ways that have little or no effect upon the 
real reliability of the scores that result from it, in other words, 
upon the size of the errors. Another way of stating the same 
point is that, since most tests do not have true zero points, 
their true means are not so exactly determined as appears on 
the surface. On the other hand, the second ratio depends 
directly upon the coefficient of reliability, hence upon the range 
of talent, which we have seen has a disturbing effect. The 
writer believes that it is advisable to make use of both of the 
suggested ratios, but to give somewhat more weight to the 
first, that is, the one based upon the mean, than to the second. 
If they agree in indicating that one test is more reliable than 
another, that conclusion may be accepted with a high degree 
of confidence. 


EXERCISES AND PROBLEMS 

1. Compute the coefficients of reliability, the indices of reliability, 
the standard and probable errors of measurement, and the ratios of 
the latter to the means and standard deviations for the data in 
Parts (a) of Exercises 1 and 2 on page 134. 

2. Find the coefficients of reliability of tests with original coeffi- 
cients and changes in lengths as follows: 



(a) 

(h) 

(c) 

(d) 

(e) 

rii 

-f.66 

+.35 

+.96 

+.77 

+.81 

Times as long 

2. 

8. 

.5 

3. 

2.5 

3. How many times as long must tests with the coefficients of 
reliability given below be made to have the stated desired coefficients? 


(a) 

(fc) 

(c) 

(rf) 

(e) 

Actual coefficient 

+.60 

+.86 

+.98 

+.42 

+.73 

Desired coefficient 

+.90 

+.97 

+.95 

+.87 

4.92 
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4 . Find the coefficient of correlation corrected for attenuation in 
each of the following situations: 

(a) riu = +.38, ri 2 = +.42, ru = +.85, r 2 ii = +.67. 

(b) ri2 — +.45, m = +.92, r2ii == +.86. 

(c) ri 2 = +.63, Till = +.65, ri 2 == +.58, = +.66, ru = +.82, r 2 ii = +.84. 

(d) rin = +.79, ri2 = +.81, ru = +.89, r2ii =+1.00. 
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CHAPTER X 


Rank Correlation 

Nature and function 

Rank correlation has been suggested as an easier means of 
measuring and expressing the relationship between two vari- 
ables than is product-moment correlation. It is a method, or 
rather includes two methods, that takes account only of ranks 
or relative positions of scores, not of their exact values. Since 
it can be used only with measures in columns, not with those 
in correlation tables, since its computation rapidly becomes 
more laborious as the number of cases increases, since it is 
less reliable than product-moment correlation, also less valid 
for data approximating normality and of various other shapes, 
it is rarely employed with more than thirty or forty cases. 
In some instances involving larger numbers of cases only ranks 
are available; hence product-moment correlation cannot be 
found, and so rank correlation may be employed. If there are 
tied ranks in one or both series, the resulting value of rank 
correlation is too large. It can be, but rarely is, corrected by 
a little-known formula. 

In general, rank correlation may be considered an easy, 
approximate method to be employed when the number of cases 
is so small that the slight difference in reliability between it and 
product-moment correlation is so much less than the unreliabil- 
ity due to the small number of cases as to be negligible. If the 
shape of a distribution is rectangular or approximately so, rank 
correlation is generally the better measure of the relationship 
existing. 

Assigning ranks 

The first step in computing a coefficient of rank correlation 
is to assign ranks to the measures in the two series. To illus- 
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trate how this is done, a set of twenty-five scores, given in 
Table XXV, will be used. They are arranged in order of size, 
a convenient but not necessary procedure. Insofar as the 
calculation of rank correlation is concerned, it is inunaterial 
whether rank 1 is given to the highest or to the lowest score, 


TABLE XXV 

Assignment of Ranks to Scores 


Score 

Rank 

Score 

Rank 

Score 

1 Rank 

97 

25 

86 

16 

78 

8.5 

95 

24 

86 

16 

74 

7 

94 

22.5 

86 

16 

73 

6 

94 

22.5 

85 

14 

71 

5 

91 

21 

83 

13 

70 

4 

90 

20 

82 

12 

65 

3 

89 

19 

81 

11 

62 

2 

88 

18 

79 

10 

57 

1 



78 

8.5 




but it is consistent with percentile and other -He ranks to give 
it to the lowest, so that practice is followed here. 

The ranks assigned to the given seores are shown in the 
second of each pair of columns in the table. The lowest score, 
57, receives rank 1 ; the next, 62, rank 2 ; and so on up to 74, 
which receives rank 7. The next two scores are both 78. In 
cases of such tied or equal scores the average of the ranks that 
would be assigned them if they were not tied is given to each. 
Here their ranks, if unequal, would be 8 and 9; hence 8.5, the 
average of 8 and 9, is given each. The next score, 79, receives 
10, and so on until 85 is rank 14. The next three are tied at 
86; hence each receives the average of 15, 16, and 17, which is 
16. Another tie, of two cases, occurs at 94; hence each is 
ranked 22.5. Finally the highest is giv'en rank 25, in other 
words, rank N. 


Computing coefficients of rank correlation 

The easier but less reliable and in general less valid of the 
two methods is often called the foot-rule method. Its formula is 



A* - r 
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in which R is the symbol for the coefficient of rank correlation 
and g for the gains or positive differences in rank from one series 
to the other. The better but more difficult method employs 
the formula 

_ 6SD* 

" ~ Nim - 1 )’ 


in which p (rho) is the coefficient of rank correlation and D the 
difference in rank, whether positive or negative. The values 
obtained for R range from 4-1.00 down to —.50 if the number 
of cases is odd and to slightly more than that, depending on 
the number of cases, if it is even. Those for p range from 
+1.00 down to —1.00 and generally are close to those of 
product-moment r for the same data. As will be explained 
more fully in the next section, both R and p may be transmuted 
into approximately equivalent values of r. 

Table XXVI presents the computation of both rank coeffi- 
cients for the same data already employed for column product- 
moment correlation and elsewhere. The first two columns 
contain the scores themselves and the next two, headed R^ and 
Rg, their ranks assigned according to the procedure explained 
in the last paragraph. The column headed D contains the 
differences in rank, found by subtracting each entry in the R^ 
column from the corresponding one iii the Rx column. The 
sum of the positive differences, or gains, is Sgr, here 28. It is 
advisable, although not necessary, to find the sum of the 
negative differences, or losses, also, as a check. Here Si is —28, 
numerically equal to 'Sg, as it always should be. Finally 28 
is substituted in the formula, giving 

« - 1 - Iar?i - + -sra. 


The last column of the table contains the squares of the 
entries in the D column. Since all become positive through 
squaring, they are totalled into a single sum, 249. Using this 
in the second of the two formulas given above, we have 


6 X 249. 
20 ( 20 * - 1 ) 


+ . 813 . 
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TABLE XXVI 

Computation of Coefficients of Rank Correlation 


X 

Y 

It. 

! 

I I) 

IP 

135 

92 

20 

18 

+2. 

4, 

131 

97 

19 

20 

-1. 

1. 

126 

95 

18 

19 

-1. 

1. 

119 

77 

17 

9 

+8. 

64. 

115 

84 

16 

15 

+ 1. 

1. 

114 

89 

15 

17 

-2. 

4. 

108 

85 

14 

16 

-2. 

4. 

107 

78 

13 

10 

+3. 

9. 

104 

83 

11.5 

14 

-2.5 

6.25 

104 

71 

11.5 

4 

+7.5 

56.25 

101 

81 

10 

13 

-3. 

9. 

99 

76 

9 

8 

+ 1. 

1. 

98 

80 

8 

12 

-4. 

16. 

96 

72 ! 

7 

5 

+2. 

4. 

92 

75 

5.5 

7 

-1.5 

2.25 

92 

70 

5.5 

3 

+2.5 

6.25 

89 

79 

4 

• 

-7. 

49. 

88 

73 

3 

6 

-3. 1 

9. 

86 

55 

2 

1 

+ 1. 

1. 

81 

64 


2 

1 -1. 

1. 




i 

+2S,0 = Xq 
-2s.n = 

249.00 - 2D2 

! 


ffi = 1 - ; = +.579 or 1 -.01504 X 28. = +.579 (equivalent r = +.81) 

20^ — 1 

V 940 

p = 1 - - ^ = +.813 or 1 - .000752 X 249. = +.813 (equivalmt 

20(20-’ - 1) r = +.83) 


The computation of both R and p may be facilitated by the 

0 0 

use of a table containing values of and for 

various values of N. With such values at hand the worker 
merely multiplies the correct one by Sgr, if R is to be found, and 
by SD®, if p is wanted, and subtracts the product from 1. 
Table XXVII presents these values for iV ranging from 10 to 
40, inclusive. Thus, as shown at the bottom of Table XXVI, 
R for the data therein may also be obtained by finding the entry 
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TABLE XXVII 
6 6 

Values of r and Corresponding to Values 

iV* — 1 N{N^ — 1) 

OF N FROM 10 TO 40 


N 

6 

6 

N 

6 


N 

6 j 

6 

AT* - 1 

iV(iV* - 1) 

V* - 1 

N(N^ - 1) 

- 1 

N(N^ - 1) 

10 

.06061 

.006061 

20 

.01504 

.000752 

30 

.00667 

.000222 

11 

.05000 

.004545 

21 

.01364 

.000649 

31 

.00625 

.000202 

12 

.04196 

.003497 

22 1 

.01242 

.000565 

32 

.00587 

.000183 

13 

.03571 

.002747 

23 i 

.01136 

.000494 

33 

.00551 

.000167 

14 

.03077 

.002198 

24 

.01043 

.000435 

34 

.00519 

.000153 

15 

.02679 

.001786 

25 

.00962 

.000385 

35 

.00490 

.000140 

16 

.02353 

.001471 

26 

.00889 

.000342 

36 

.00463 

.000129 

17 

.02083 

.001225 

27 

.00824 

.000305 

37 

.00439 

.000119 

18 

.01858 

.001032 

28 

.00766 

.000274 

38 

.00416 

.000109 

19 

.01667 

.000877 

29 

.00714 

.000246 

39 

.00395 

.000101 







40 1 

.00375 

.000094 


0 

for Z 1 when N = 20 in Table XXVII and preceding as just 
stated. Since this entry is .01504, 

B = 1 - .01504 X 28. = + .579. 


Similarly, 


P = 1 - .000752 X 249. = + .813. 


Changing R and p Into r 

As was mentioned earlier, means are available for transmu.t- 
ing R and p into r. The results are not usually exactly the 
same as the directly computed value of r for the same data, but 
approximate it. The discrepancy is due to the fa(5t that the 
transmutation formulas are based upon an assumption as to 
the shape of the distribution of scores that is rarely fulfilled. 
For the two series of ranks p is just the same as r, but since the 
exact data rarely have the same rectangular distribution as do 
their ranks, p for the ranks and r for the exact data differ. 
The formulas used are trigonometric, being 

r = 2cos|(l - fi) - 1 


and 
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A • TT 

r == 2 sm ^ p. 
o 

Since the use of these formulas is somewhat laborious, the 
values of R and p that correspond to values of r at intervals 
of .01 are presented in Table XXVIII. The first column con- 

TABLE XXVIII 


Values of r Corresponding to Values op R and p 


r* 

Posi- 

tive 

R 

Nega- 

tive 

R 

Posi- 

tive 

or 

Nega- 

tive 

P 

r 

Posi- 

tive 

R 

Nega- 

tive 

R 

Posi- 

tive 

or 

Nega- 

tive 

p 

r 

i 

Posi- 

tive 

R 

Nega- 

tive 

R 

Posi- 

tive 

or 

Nega- 

tive 

p 

.00 

.000 

.000 

.000 

.34 

.201 

.179 

.326 

.68 

.453 

.347 

.663 

.01 1 

.006 

.006 

.010 

.35 

.208 

.184 

.336 

.69 

.461 

.351 

.673 

*.02 

.011 

.011 

.019 

.36 

.214 

.189 

.346 

.70 

.470 

.356 

.683 

.03 

.017 

.017 

.029 

.37 

.221 

.194 

.355 

.71 

.480 

.361 

.693 

.04 

.022 

.022 

.038 

.38 

.227. 

.199 

.365 

.72 

.489 

.366 

.703 

.05 

.028 

.027 

.048 

.39 

.234 

.204 

.375 

.73 

.498 

.371 

.714 

.06 

.034 

.033 

.057 

.40 

.240 

.209 

.385 

.74 

.508 

.375 

.724 

.07 

.039 

.038 

.067 

.41 

.247 

.214 

.394 

.75 

.517 

.380 

.734 

.08 

.045 

.044 

.076 

,42 

.254 

.219 

.404 

.76 

.527 

.385 

.744 

.09 

.050 

,049 

,086 

.43 

.261 

.224 

.414 

.77 

.538 

.390 

.755 

.10 

.056 

.055 

.096 

.44 

.268 

.229 

.424 

.78 

.548 

.395 

.765 

.11 

.062 

.060 

.105 

.45 

.274 

.234 

.433 

.79 

.559 

.400 

.776 

.12 

.068 

.065 

.115 

.46 

.281 

.239 

.443 

.80 

.569 

.404 

.786 

.13 

.074 

.070 

.124 

.47 

.288 

.244 

.453 

.81 

.580 

.409 

.796 

.14 

.079 

.076 

.134 

.48 

.296 

.249 

.463 

.82 

.592 

.414 

.807 

.15 

.085 

,081 

.143 

.49 

.303 

.254 

.473 

.83 

.603 

.419 

.817 

.16 

.091 

.086 

.153 

.50 

.310 

.259 

.483 

.84 

.615 

.423 

.828 

.17 

.097 

.091 

.163 

.51 

.317 

.264 

.492 

.85 

.628 

.428 

.838 

.18 

.103 

.096 

.172 

.52 

.324 

.269 

.502 

.86 

.641 

.433 

.849 

.19 

.109 

.102 

.182 

.53 

.332 

.274 

.512 

.87 

.654 

.438 

.859 

.20 

.115 

.107 

.191 

.54 

.339 

.279 

.522 

.88 

.667 

.443 

.870 

.21 

.121 

.112 

.201 

.55 

.347 

.283 

.532 

.89 

.682 

.448 

.881 

.22 

.127 

.117 

.211 

.56 

.354 

.288 

.542 

.90 

.697 

.452 

.892 

.23 

.133 

.123 

.220 

.57 

.362 

.293 

,552 

.91 

.713 

.457 1 

.902 

.24 

.139 

.128 

.230 

.58 

.370 

.298 

.562 

.92 

.729 

.462 

.913 

.25 

.145 

.133 

.239 

.59 

.378 

.303 

.572 

.93 

.747 

.467 

.924 

.26 

.151 

.138 

.249 

.60 

.385 

.308 

.582 

.94 

.766 

.471 

.934 

.27 

.157 

.143 

.259 

.61 

.393 

.313 

.592 

.95 

.786 

.476 

.945 

.28 

.163 

.148 

.268 

.62 

.402 

.318 

.602 

.96 

.809 

.481 

.956 

.29 

.169 

.154 

.278 

.63 

.410 

.322 

.612 

.97 

.834 

.486 

.967 

.30 

.176 

.159 

.288 

.64 

.418 

.327 

.622 

.98 

.865 

.490 

.978 

.31 

.182 

.164 

.297 

.65 

.427 

.332 

.632 

.99 

.905 

.495 1 

.989 

.32 

.188 

.169 

.307 

.66 

.435 

.337 

.642 

1.00 

1.000 

.500 or 

1.000 

.33 

.195 

.174 

.317 

.67 

1 .444 

.342 

.652 



more ' 



* The sign of r is determined by that of R or p. 
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tains the values of r; the second has the corresponding ones of 
R when it is positive; the third has those of negative jB, in 
which case the minus sign should be prefixed to the correspond- 
ing value of r; and the fourth those of p, for which its sign, 
plus or minus as the case may be, should be prefixed to r. 

From Table XXVIII it appears that for R = +.579, the 
value found in Table XXVI, the corresponding value of r is 
+.81, and that for p = +.813, r = +.83. These are both 
fairly close to +.84, the value of r found directly by the product- 
moment method. 


EXERCISES AND PROBLEMS 

1 . Compute both coefficients of rank correlation for each set of 
data in Exercise 1 on page 134. 

2. Find the value of r corresponding to each of the rank cocffi-; 
cients computed in Exercise 1. 
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CHAPTER XI 


The Correlation and Regression of 
More Than Two Variables 


Partial correlation 

Partial and multiple correlation and regression, that is, cor- 
relation and regression involving more than two variables, is 
too large a topic to be treated at all adequately in an elementary 
text. It seems desirable, Imwever, that the chief functions of 
such procedures be stated and their natures explained briefly. 

The coefficient of partial, or net, correlation is an estimate of 
what the ordinary coefficient of correlation between two vari- 
ables would be if the portion of the total correlation associated 
with one or more other variables were eliminated. In other 
words, it is an estimate of what the coefficient would be if the 
values of the one or more others were held constant. For 
example, the correlation between height and weight of a group 
of children in Grades I-VIII is associated with the fact that 
both height and weight correlate with age ; hence, if this portion 
of the total correlation is removed, that remaining is less. The 
result so obtained is an estimate of the average coefficient that 
would be found between height and weight for a number of 
subgroups, each of which contained children all of the same age. 

Although the technique of partial correlation is frequently 
valuable in investigations and studies concerned with cause, the 
evidence it offers never proves causal relationship, but only 
association. There are also other limitations connected with 
its use, largely because we usually do not know enough about 
the nature of the intercorrelations among the variables con- 
cerned; so much caution should be exercised in its interpreta- 
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tion. It is only when the variable or variables eliminated act 
as a single factor or unified group of factors upon the two 
between which the correlation is desired that the partial co- 
efficient is really a valid estimate of what the ordinary coeffi- 
cient would be if the one or more others were held constant. 
Moreover, since it is based upon product-moment coefficients, 
it measures only rectilinear relationship. 

The symbol for a coefficient of partial correlation is r with a 
subscript of as many symbols as there are variables involved. 
The first two sjnnbols denote the two variables between which 
the partial correlation is found, and the one or more following 
them denote the variable or variables eliminated. A point 
halfway up is used to separate the first two from the other or 
others. Thus rn-z means the coefficient of partial correlation 
between variables 1 and 2 when variable 3 is eliminated; 
ri 3.24 that between variables 1 and 3 when 2 and 4 are eliminated; 
and so on. Coefficients are referred to as being of the order 
represented by the nximber of symbols after the dot. Thus 
ordinary coefficients, which have none, are zero-order coefficients; 
those such as ria.s and r„.„ are first order; those such as rii.zi 
and rjic.BD are second order; and so on. 

The general formula for finding any partial coefficient is in 
terms of coefficients of the next lower order. For any number, 
n, of variables, it is 


_ • . (n-l) ^ln.34- • • • ( n~l)y*2n»34 ■ > » (n-1) 

\/(l — r^i„.34- . •(n-l))(l “ ^2n.34. • • (n-1)) 

The denominator may also be written 

fcln«34- • • (n— l)fc2n»34- • •(n-l)- 

For only three variables it becomes 


or 


ri2.3 = 


ri2 - ri3r23 

V(1 - ,^3)(1 - riz) 


ri2 - rizrzs 


Since the order of the subscripts after the dot, as well as that 
of those before the dot, is not significant, the formula for 
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second-order coefficients may be written in either one of two 
ways, 

ri2.3 — ri4.3r24.3 


or 


ri2.34 or ri2.43 = ^ j 

A^14.3A;24.a 
^12.4 ~ ^1 3»4r23»4 

kiZ.JC2Z,A 

that for third-order coefficients in any one of three, and so on. 
In such cases the use of at least two ways provides a check 
upon the result. Anyone planning to compute many partial 
coefficients should consult some more extensive treatment of 
the topic for suggestions as to convenient and economical 
organization of the work. 

To illustrate the use of the formula, we may assume that a 
coefficient of -1-.82 has been found between height and weight, 
one of -1-.85 between height and age, one of -1-.76 between 
weight and age, and that We wish the partial coefficient be- 
tween height and weight with age eliminated. Using the 
initial letters as subscripts, we have 

.82 - .85 X .76 


^HW-A — 


'^IIW — '^iia'^wa 


V(1 - - rW) V(l '- .852)(r - .760 


- = + 51. 


In other words, .51 is the estimated correlation between height 
and weight when age is constant. It is also possible to find 
the partial coefficient of height with age when weight is elimi- 
nated and of weight with age when height is eliminated. Thus 

.85 - .82 X .76 


and 


Tha-w — 




V(1 - .820(1 - .760 


= + -61 


.76 - .85 X .82 


■"‘W = + .21. 


y/{i - .850(1 .820 

In most cases partial coefficients are smaller than the cor- 
responding zero-order coefficients, but they may be larger, 
especially when one of the coefficients is negative. Thus if 
ri 2 = -b.60, ris = -+-.20, and ra = -+-.80, 

.60 - .20 X .80 


ri2.s = 


V(1 - .200(1 .800 


= + .75, 
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and if ri 2 = +.60, rn = —.20, and th = +.40, 


ri2.8 


. 60 - (-.20 X .40) ^ 

Va- .202)(1 - .402) • ’ 


both larger than +.60. Their range is the same as that of 
zero-order coefficients, from —1.00 to +1.00. 

The reader should not confuse partial correlation with part 
correlation, semipartial correlation, and other varieties which 
have some characteristics in common with it, but differ in 
others. They are much less frequently employed. 


Multiple correlation 

The coefficient of multiple correlation is the coefficient of cor- 
relation between one variable and two or more others combined 
or weighted so as to yield the maximum relationship possible 
from an additive, as contrasted wth a multiplicative, exponen- 
tial, or other, combination of them. It is based upon zero- 
order coefficients, and hence measures rectilinear relationship 
only. It is always positive, ranging from 0 up to +1.00, and 
therefore indicates the amount of relationship without regard 
to direction. 

The formula for a coefficient of multiple correlation can be 
written in terms of zero-order coefficients, but in this form 
becomes very unwieldy for more than three or four variables. 
A simpler form is in terms of partial correlation or partial 
alienation coefficients. In situations involving more than a 
few variables, computation by these methods becomes very 
laborious. A number of improved methods are available. Of 
these, the one commonly called the Doolittle Method is prob- 
ably the best for most workers. Explanation of it, as well as 
other helpfxil suggestions, may be found in some of the refer- 
ences at the end of this chapter. 

The general formula for multiple correlation may be written 

Rum - • .» or Ru23- • -n) — 

''/l — (1 — rfzlfl ri 3 . 2 )(l ri4.23) (1 — rin.23. • -(n-i) 

or 

— ^12^3.2^14.28 Alln.23. • -(n-l). 



MORE THAN TWO VARIABLES 


171 


When only three variables are concerned, that is, when one is 
correlated with the combination of two others, it reduces to 

Ri -23 = ”^1 (1 — ^12) (1 — ^3-2) 

or 

VF— kiiki3-2, 

or, in terms of zero-order coefficients, 

/ri2 4" rjj — 2ri2ri3r23 

Since the order of subscripts after the point is immaterial — 
so that ^1.23 and R1.32 are the same, as likewise are iJi.234, 
f2i-243, i?i.324, f2i-342, f2i.423, and Ri.i32, and so on in increasing 
number for more variables — ^multiple coefficients may be com- 
puted in two or more ways in order to check. Thus for Ri.2z 
or R1.32, which arc the same, we may use 

Vl — (1 — r? 3 )(l — riz-z) 

instead of the form given above. 

To illustrate the computation of a multiple coefficient for 
one variable with two others, the data employed in the preced- 
ing section may be used. It will be recalled that the given 
zero-order coefficients were = +. 82 , = + . 85 , and 

fwA — +. 76 , from which it was found that Thw-a — +. 51 , 
1 'uA‘W = +.61 and r^-A-u = +. 21 . From these, we have 


Rn.wA = Vl - (1 - .820(1 - .6P) 


.822 + .852 _ 2 X .82 X .85 X .76 
1 - .762 


or, using it as Ru-aw, 

Vi - (1 - . 852 ) (1 _“; 5 P) , 

each of which equals . 89 . Similarly, 

Rw-iia = \/ r -”( T ~ 822)(1 - . 212 ) 


or 
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Vl - (1 - .76*) (1 - .51=*) 


or 


and 

or 


or. 


4 


.82* + .76* - 2 X .82 X .76 X .85 _ 
1 - .85* 


.83; 


Ra.hw = Vl - (i - .85*) (1 - .21*) 


Vl - (1 - .76*) (1 - .61*) 


4 


85* + .76* - 2 X .85 X .76 X .82 _ gp. 
1 - .82* 


Each of the multiple coefficients just found is only slightly 
greater than the highest zero-order coefficient of the dependent 
variable with one of the independent ones. In other words, 
the total degree of relationship is increased only slightly by 
adding the influence of the second independent variable. The 
reason for this is that the two independent variables, in each 
case, are rather closely correlated with each other. For the 
same coefficients with the dependent variable, the lower the 
correlation among the independent variables, the higher 
the multiple coefficient. Thus, for example, if = +.60, 
ri 3 = +.60, and r23 = +.80, i2i.23 = .63; but if r 23 drops to 
+.60, El. 23 .67; if J■23 = +.40, Ri-a = >72; if j'23 = +.20, 

Ei .23 = .78; and if r23 = .00, E1.23 = .85. Multiple R can 
never be less than zero-order r. 

Quite often a multiple coefficient found from one set of data 
is considered to apply to one or more comparable sets of data 
or to the total population, of which a sample was used to secure 
the obtained coefficient. In both cases the value obtained is 
larger than the true value for the other situation. Generally 
the difference is so small — ^perhaps .01 or .02 — that it is neg- 
lected; but, if many variables are concerned, the number of 
cases rather small, and the multiple coefficient low, the differ- 
ence — or shrinkage, as it is called — ^may be large enough to be 
significant. Quite similar, but not identical, correction formula 
based upon the number of cases and of variables are available 
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for the two types of situations. That for estimating the 
mxxltiple coefficient for a comparable situation is 


•Kcorr — 


,1 1 - 
(1 or 


1 - 


ji 

N 


(nW- 


— n 
n 


and that for estimating the multiple coefficient for the total 
population is 



in which R is the obtained multiple coefficient, N the number 
of cases, and n the nvunber of variables. To illustrate their 
application, let us take R = .60 for 50 cases and 5 variables. 
For a comparable situation. 


Ra 




50J< .60^ -zJ = ^ 
50-5 


and, for a total population of which the 50 cases constitute a 
sample. 




It is evident that the correction formula is worth applying in 
such cases. If, however, the obtained coefficient is .80, the 
number of cases 100, and that of variables only 4, the formulas 
give, respectively, 

. .79 

\ 100 - 4 

and 


p _ /l - 4 + .802(100 - 1) _ 7Q 

^ 4 - 


both only .01 less than the obtained value of R. 


Partial and multiple regression 

Partial and multiple regression bear the same relationship to 
partial and multiple correlation as does ordinary regression to 
zero-order correlation. Partial or net regression refers to the 
best rectilinear means of estimating a dependent variable from 
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one of two or more independent ones when the relationship 
associated with the other independent one or ones is eliminated. 
Multiple regression refers to the best rectilinear means of 
estimating a dependent variable from a combination of two or 
more independent ones. Whether the equation employed is 
considered as being a partial regression equation or a multiple 
regression equation depends upon the desired emphasis — 
whether it is on the separate terms thereof or on the whole 
equation. In effect, partial regression equation and multiple 
regression equation are synonymous. Such an equation should 
be employed whenever we wish to estimate one variable from 
two or more others with which it is correlated and are content 
to use a rectilinear relationship. 

Such equations are similar to those of zero-order regression 
except for having as many terms as there are variables con- 
cerned and for employing partial correlation coefficients and 
standard deviations instead of ordinary ones. Since they use 
the latter, methods of computing them will be given. 

A partial standard deviation is simply the standard deviation 
of the variable involved when the effects of others included in 
partial correlation are eliminated. Its formula may be given 
in any of several forms, of which the following are the most 
frequently employed: 


ffl’tu- • -n — Vi'N^(l — ^2)(1 — ri3.2)(l — ^14.23) (1 “ r^n-n- ■ -Cb-d)) 


and 


fflkijcii.ijcu.ti kin-za- • -(n-l) , 


fflV^l — Ri.iU- ■ - n • 

For two variables the formula becomes 

(Ti-i = <71 1 — ri2 or < 7 iA:i 2 

and for three it is 


<71.23 = <7lV'(l — r?2)(l — rf3.2), 
ejciikis.i, 

V1.2V1 — ri 3 .j, 

or 

ei‘^1 “ f2i.23> 
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For three or more variables there are increasing numbers of 
variations which may be used, just as with partial coeflEicients. 
Thus, since <ri.23 and o’i.32 are the same, the formula for them 

maybe 

— ?'l 3 )(l — ^12.3) f 
c ikiskii-s, 

or 

ffl.sVl — ri 2.3 

as well as the forms already given. 

The partial regression coefficients are found by a procedure 
analogous to that for zero-order coefficients; hence 


bli-U- ■ - n — Ti2.Zi 

For three variables this becomes 

612.3 = r, 2 . 3 (‘'‘-Y 

\ 0 ' 2 . 3 / 


0 ’l- 34 - 


0 ’ 2 - 34 - 


n 


n 


which may be changed to 

/ oiVl -j5 ,\ 


(n 2 - ru -rn)ffi 

(i ~ ’ 


forms sometimes more convenient to use. 

After partial regression coefficients have been found, they 
may be employed to make partial or multiple regression equa- 
tions by the same procedure followed for ordinary regression 
equations, the only difference being the presence of more than 
one independent variable. Thus in general, if Xi, Xz, X3, and 
so on to X„ are the variables involved, the equation is 

Xi = 612.34 • • -niXl — M2) + 613.21. • -niXs — Ms) -f- 

.... 6ln*23- • • • (n-1) (X„ - M„) + Ml. 


The use of such an equation may be illustrated by data 
secured for predicting college marks from high-school marks and 
other data. The data for three variables — ^high-school per-cent 
mark, intelligence test score, and college per-cent mark — were 
as follows, using the initial letters to designate them: Mu = 84 ., 
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— 8.5, Mj — 54., (Tj — 10.2, Me — 82., <Tc ^ 9.6, Tjjj = -4“ .64, 
Tch = +.55, Tci = +.52. The equation desired is that for 
estimating college marks from high-school marks and intel- 
ligence test scores, that is, 

~ “ M//) + bci*H{I Ml) + Me* 

Since 


r _ __ (-55 — .52 X .64)9.6 

(l-rLVff “ (1 - .642)8.5 


= .4155 


and 


I _ (Tci — _ (.52 — .55 X .64)9.6 

(l-r^w)<fi ~ “( 1 - .^ 2 ) 10.2 ^ 

the equation is 


.2678, 


C = .4155(77 - 84.) + .2678(7 - 54.) + 82., 
which reduces to 


C = A2H + .277 + 32.6. 

Stated verbally, this means that the best prediction of a college 
mark is obtained by adding .42 of high-school mark, .27 of 
intelligence test score, and 32.6. 

In case there is no need to estimate actual scores, but only 
relative ones — a situation that sometimes exists — an easier 
formula and method may be employed. If the dependent 
variable is denoted by 1, the independent ones by ^ and 3, 
the weight to be ^ven variable 3 when 2 is weighted l.(X) is 
given by 

fy = < ^2(2*13 ^ I 2 ‘l' 23 ) 

<^3(^12 — *'13^23) 

In this case, calling college mark 1, high-school mark 2, and 
intelligence test score 3, 

ly = ~ _ _?,:6(i62_— .55 X .64) _ ^ 

(riiXcH - Tc/rttj) 10.2(.55 -T52‘x .645 ~ 

Hence, if high-school mark is weighted 1.00, intelligence score 
is weighted .64. This is, as it must be, the ratio between the 
corresponding weights, .27 and .42, yielded by the regression 
equation. 
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The interpretation of partial and multiple correlation and 
regression is similar to that of zero-order correlation and regres- 
sion. Partial k has already been introduced, and multiple K 
and partial and multiple Em, g, and E, may be similarly inter- 
preted. Errors of estimate may also be found in the same way. 
When obtained, all these measures of relationship and error are 
subject to interpretations similar to those given in Chapter 
VIII. 


EXERCISES AND PROBLEMS 

1. Compute the coefficients of partial and multiple correlation, and 
the partial or multiple regression equations, for each of the following 
sets of data: 


(a) 


Algebra 

Arithmetic 

Prognostic 

Mark 

Mark 

Test Score 

83 

92 

71 

79 

83 

62 

91 

96 

76 

85 

83 

64 

88 

93 

66 

72 

76 

53 

59 

65 

42 

86 

88 

70 

77 

83 

61 

75 

75 

62 

95 

94 

81 

82 

87 

77 

68 

73 

48 

74 

82 

60 

87 

93 

74 

71 

72 

57 

85 

86 

69 

90 

89 

73 

78 

84 

63 

84 

86 

65 


(ft) 

.V, = 48, Mt = 29, M, = 74, 

O’! = 7.2, a'2 ~ 5.5, (73 = 8.8, 

ri2 == 4" *48, ri3 = +.75, = +.33. 


(c) 

Ml = 35, M 2 = 76, 3/3 = 57, Mi = 28, 
<71 = 4.2, (72 = 12.6, (73 = 6.6, (74 = 3.9, 
Ti 2 == +-65, ri3 = +-68, rn = +.49, 

T 23 = +-36, r2i = +.44, rs4 = .22. 


2. Compute W, as used in the last section, for Parts (a) and (h) 
of Exercise 1. 


REFERENCES 

Ezekiel, Mordecai, Methods of Correlation Analysis. Second Edition. 
Chaps. X-XVI, XXI-XXII ; pages 450-460, 474^85. New York : 
John Wiley & Sons, Inc., 1941. 



178 


MORE THAN TWO VARIABLES 


Fisher, R. A., Statistical Methods for Research Workers. Eighth Edi- 
tion. Pages 180-185, 252-254. Edinburgh: Oliver and Boyd, 

1941. 

Garrett, Henry E., Statistics in Psychology and Education. Second 
Edition, Chap. XIV. New York: Longmans, Green & Com- 
pany, Inc., 1937. 

Guilford, J. P., Fundamental Statistics in Psychology and Education. 
Chap. XIII. New York: McGraw-Hill Book Company, Inc., 

1942. 

Kenney, John P., Mathematics of Statistics. Part II, Chap. V. New 
York: D. Van Nostrand Company, Inc., 1939. 

Lindquist, E. F., Statistical Analysis in Educational Research. Pages 
248-257. Boston: Houghton Mifflin Company, 1940. 

Mode, Elmer B., The Elements of Statistics. Pages 312-314. New 
York: Prentice-Hall, Inc., 1941. 

Monroe, Walter S., and Engelhart, Max D., The Scientific Study of 
Educational Problems. Pages 377-382. New York: The Mac- 
millan Company, 1926. 

Peters, Charles C., and Van Voorhis, Walter R., Statistical Procedures 
and Their Mathematical Bases. Chap. VIII. New York: Mc- 
Graw-Hill Book Company, Inc., 1940. 



CHAPTER XII 


Other Methods of Correlation 


Ratios of correlation 


Ratios of correlation, sometimes known as coefficients of cur- 
vilinear correlation, are the most commonly employed measures 
of curvilinear correlation. They are found from double-entry 
tables such as are used for coefficients of correlation and make 
use of most of the same steps plus a few additional ones. For 
each such table with two variables there are two ratios, one 
for each variable on the other. In cases where a variable and 
an attribute are correlated, a ratio may be found for the 
variable on the attribute, but not vice versa. 

A ratio of correlation sununarizes the deviations of the data 
from the curved line that best fits the means of the rows or 
columns, according to which ratio it is, just as a coefficient 
summarizes the deviations from the straight line that best fits 
both sets of means. It is alwa 3 '^s positive, ranging from 0 to 
H-l.OO. It cannot be less than the coefficient of correlation for 
the same data. 

The formula for a ratio, abbreviated r; (eta), may be given 
in various forms. When, as usually, the coefficient is also 
desired, the following is probably most convenient : 


Vxv - 



<r* 


and riyx 




Another form is sometimes preferable when the coefficient is not 
wanted, but is so rarely employed that it will not be given here. 

The computation of the ratios for the data employed in 
Table XXI is given in Table XXIX. This table repeats the 
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TABLE 

Computation of the Ratios of 


Final 

Exam. 

Score 

Total Score on 

120- 

140- 

160- 

180- 

200- 

220- 

240- 

260- 

280- 

130- 









— 

125- 








— 









1 

115- 






1 

1 

1 

2 

no- 




1 



1 

1 


105- 

i 




2 


1 

2 

3 

1 

100- 





2 

2 

2 

2 


95- 



1 

2 

2 

2 




90- 

1 


2 

1 

1 





85- 




! 






80- 










75- 

2 

1 

1 







fx 

2 

1 

4 

5 

5 

6 

6 

7 

4 

X 


-5 

-4 

-3 

-2 

-1 

0 

+ 1 

+2 

fx 

-12 

-5 

-16 

-15 

-10 

-6 

-64 


+8 

fa? 

72 

25 

64 

45 

20 

6 

0 


16 



-6 

-14 

-7 

-9 

-4 

+ 1 

+ 1 

+7 

'Lxy 

+72 

+30 

+56 

+21 

+18 

+4 

0 

m 

+14 


144 


■1 


81 

16 

1 

■ 

49 

(sy)« 

fx 



49.00 

9.80 



.17 

.14 

12.26 


341.46 




60 


-.0324 


2.889 


.902 


'330.23 




50 


-.0196 


2.807 


.914 
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XXIX 

Correlation Between Two Variables 
Short Tests 

i — y fy 

300- 320- 340- fy 




fy' 

Xx 


75 

+ 13 

+65 

64 

+ 16 

+64 


+3 

+ 3 

+2 

+ 14 

+ 1 

+ 3 

0 

+51 









2 

6 

2 

+3 

+4 

+5 

+6 

+24 

+ 10 

18 

96 

50 

+6 

+21 

+9 

+ 18 

+84 

+45 

36 

441 

81 

18.00 

73.50 

40.50 


4 


0 


0 -5 


- 8 

8 

-14 

* 

28 

-12 

36 


8-4 +4 16 
28 -16 +32 256 





0 


0 0 



1 

o 

1 

144 

-21 

+ 126 

-.58 

395 

+47 

+363 

■* - 7 


-56 



, = - 7-2348. ^ 332 

2.889 X 2.807 


= - ? .18 c, = -- 

50 ' 50 

= ^ = 8.3800 5* = 

60 50 

ci = .0324 c5 

= 8.3476 cl 

cx — 2.889 

^ = 7.2600 
50 

CxCy — .0252 
p = 7.2348 
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former one, except that one method of finding r is omitted, and 
it also includes the two additional rows and columns needed to 
calculate the ratios and the substitution in the formulas there- 
for. The first added column, headed (Sx)®, contains the squares 
of the entries in the Sx column, and the next contains the quo- 
tients of these squares divided by the corresponding frequencies. 
Thus, since for the top row Sx is 13, the entry in (Sx)® column 
is 13® or 169 and that in the final column is 169 divided by 3, 
the frequency for that row, which gives 56.33. This last column 
is then totalled, the result being 341.46, and this along with 
N, <r„ and substituted in the formula, giving 


Vxg ~ 


341.46 


60 


- .0324 


2.889 


.902. 


Similarly the entries in the two rows at the bottom of the table 
are found, the last one srniuned to yield 330.23, and rjy, found 
to be 


330.23 


50 


- .0196 


2.807 


= .914. 


Since these values are only slightly larger than that of the 
coefficient, -f-.892, it appears that the latter is almost as good 
a measure of the existing relationship as is either ratio. A 
number of suggestions for the interpretation of differences 
between ratios and coefficients have been advanced. The most 
widely known is Blakeman’s criterion of linearity (C of L). It 
is based upon the standard error of ?j® — r®. It has been shown, 
however, that Blakeman’s test may lead to erroneous conclu- 
sions. The use of the test for correctness of fit, which is 
explained in Chapter XIII, appears decidedly preferable. Its 
use for this purpose will be illustrated there. For still better 
understanding of the significance of ratios of correlation, readers 
should employ the tables prepared by Woo.^ 

• Woo, T. L., “Tables for Ascertaining the Significance or Non-significance of 
Association Measured by the Correlation Ratio." Biometrika, Vol. XXI, pages 
1-66. December, 1929. Also in Pearson, Karl, Editor, Tables for Statisticians 
and Biometrieians. Part II, pages xl-xliv, 16-72. Cambridge: Cambridge 
University Press, 1931. 
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The use of the ratio of correlation when one variable and one 
attribute are involved is shown in Table XXX, which contains 
data on the intelligence quotients of boys preferring certain 
types of books. In such cases the attribute may be grouped 
into any number of classes from two up. Here there are six. 
The order of the arrangement of the classes of the attribute is 
immaterial, since it does not affect the result. The computar 
tion is the same as that in Table XXIX, except that a part of 

(Sy)* 


what is there is not found here. Finding c* and Cy, and • 


/* 


for each array, and summing the latter, then using them in the 
formula, we have 


40.23 


Vvx — 


95 


- .0004 


1.61 


= .40. 


Ratios of correlation obtained as described above are subject 
to several errors, the chief of which is connected with the 
number of classes. If there are as many classes as cases, the 
ratio is always 1.00. In any instance, the more classes, 
the larger the error. An approximate correction for this error 
may be made by employing the formula 


Vcorr — 


M 



n - 3 ’ 
N 


in which n is 
subscript of jj. 


and 


the number of classes of the first term in the 
Applying this to the data in Table XXIX gives 




.902* - 


12-3 
'50 


1 - 


12-3 

60 


= .879 


flux 

*^*C0TT 


.914* 


12 


50 


1 - 


12 


.894. 


60 


Similarly, for the data in Table XXX the corrected value 
is .364. 



TABLE XXX 

Computation op the Ratio op Correlation op A 
Variable On An Attribute 
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Although this is the most commonly used correction formula 
for this purpose, another is probably preferable. It is 


'HcarT 


-V 


(A - IW - n + 1 


N -n 

This gives, for Table XXIX, 

/ (50 - 1).9022 -12 + 1 

and 


60 - 12 


= .872 


_ / (50 - 1).914^ _ 12 + 1 _ 

nvw - yj 50 -12 


and, for Table XXX, .336 — ^results not very different from 
those secured by the other formula. 


The bi-serial coefficient of correlation 

Although the ratio of correlation may be found when the 
attribute concerned is grouped dichotomously, that is, into 
only two classes, the common measure of relationship between 
such an attribute and a variable is the bi-serial coefficient of cor- 
relation. This is a rectilinear measure, ranging from —1.00 to 
+ 1.00, roughly comparable with the ordinary coefficient of 
correlation between two variables. It is based on the assump- 
tion that the attribute involved is in reality a continuous and 
normally distributed variable which, however, is grouped 
dichotomously and that its two classes are not extremely un- 
equal in their frequencies, not more so than in the ratio of 
about 9 to 1. If the distribution of the variable is platykurtic, 
a result exceeding ±1.00 may be obtained. 

There are several variations of the formula. Probably the 
easiest to use is 

{Mp - Mn)p 
,S989hx<Ttt 

In this p is the fraction of the cases in the larger of the two 
classes of the attribute, N has its regular meaning as the total 
number of cases, and h is the height of the normal curve at the 
distance from the mean such that p — .5 of the area is included 
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between an ordinate erected thereat and the mean. This 
height is found from a table of the normal curve. 

The computation of a bi-serial coefficient is shown in Table 
XXXI, which contains data comparing passes and failures on 


TABLE XXXI 

Computation of the Bi-Serial Coefficient 
OF Correlation 


Test 

Single Element 


B 

fds 

fd?, 

Score 

Passed 

Failed 

Total 

110- 



3 

+5 

+ 15 

+ 15 

75 

100- 


2 

9 

+4 

+28 

+36 

144 

90- 


1 

13 

+3 

+36 

+39 

117 

80- 


4 

18 

+2 

+28 

+36 

72 

70- 

18 

6 

24 

+ 1 

+ 18 

+24 

24 

60- 

22 

10 

32 

0 

+ 125 

+150 

0 

50- 

15 

12 

27 

-1 

-15 

-27 

27 

40- 

9 

15 

24 

-2 

-18 

-48 

96 

30- 

6 

13 

19 

-3 

-18 

-57 

171 

20- 

2 

7 

9 

-4 

-8 

-36 

144 

Total 

108 

70 

178 


-59 

-168 

870 


+66 -18 


h = height including .607 —.5 of area =.964 


c, = ^ = +.611 M, = 65. +.611 X 10. = 71.11 
108 


cn 


-18 

178 


-.101 Ms = 65. - .101 X 10. = 63.99 


^8TO 
^ 178 

cs 

as ' 
i 


4.8876 

.0102 

4.8774 

2.208 

10 . 


(71.11 - 63.99).607 4.3218 


'w» 


X .964 X 22.08 8.4906 


+.61 


as » 22.08 
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one element of a test with total test scores. Since there are 

more passes than failures, p is the fraction of passes, 

which equals .607. This minus .5 gives .107, the area between 
the mean and an ordinate at the point for which h is found. By 
finding .107 in the third column of either table in the Appendix 
and interpolating to secure the corresponding entry in the 
second column, h is determined to be about .964. By the 
usual method Mp is found to be 71.11, is 63.99, and (Tjv is 
22.08. Substituting these in the formula, we have 

(71.11 - 63.99). 607 
.3989 X .964 X .22.08 


the correlation between scores on the single element and those 
on the whole test. 

The bi-serial coefficient is roughly comparable with the 
product-moment coefficient, but closer comparability may 


.3989/1 


be secured by multiplying the former by 

Vp(l. - p) 

data in Table XXXI this multiplier becomes 


For the 


_^^89 X .964 

v.eWcr- -eo?)’ 


which equals .787. This times -1-.51 is +.40, the improved 
estimate of what r would be for the same data. The same 
result may be obtained without actually calculating the bi- 
serial coefficient by the use of the formula 

r = 

vnVi.-p 


For these data it gives 

(71.11 - 63.99) 

^ 22.08Vi. - .607 


+ .40. 


The bi-serial coefficient has been used frequently for the 
purpose of measuring the validity of single test elements. If, 
as in the example above, score on an element is correlated with 
'that on the whole test, the result is a measure of internal 
validity; if it is correlated with an outside criterion, the result 
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is a measure of external validity. There are numerotis other 
methods of measuring element validity, but this is generally 
considered to be one of the best. 

If, as is usual when element validity is being investigated, all 
that is desired is a comparable index of the validity of elements 
on the same test, a simplified form of the formula may be 
employed. Since .3989 and <Tif are common to the bi-serial 
coefficients of all the elements, they may be omitted, thus 
leaving 

(My - 

h 

Furthermore, if the same assumed means are used, for p and N, 
respectively, throughout, the corrections may be substituted 
for the means, giving 

(C f - Cy)p 

h 

Results from either of these abbreviated formulas are perfectly 
correlated with bi-serial coefficients. For the data in Table 
XXXI, the second gives 

[■611 - (-.lODl.eo? , 

.964 

This is .88 X +-51, the bi-serial coefficient, and all values of 

(cy - cn)v 
h 

for elements on the same test will be 

.88/m, 

for the same elements. 

It has been suggested that simplification may go a step 
farther and h also be dropped. The results tend to arrange 
elements in the same order of validity, but to indicate differ- 
ences in their degrees of validity less accurately as differences 
in p become greater. 

The coefficient of contingency 

The best measure of the correlation of attributes, or of an 
attribute and a variable, when the classification is more than 
two-fold, is the coefficient of contingency, abbreviated C or CC. 
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The formula for this coefficient is given in several forms, 
some rather complex, but may be stated quite simply as 


- 1 


C = 

In this T represents which /„ is the frequency in a 


given cell of the table, /, the total frequency in the row in 
which the cell lies, and /« that in the column. Thus the effect 
of the formula is to compare the number of cases in each cell 
with the number that would fall there by pure chance if each 
row and column had its stated frequency. 

As an example of data for which this method of measuring 
correlation is suitable, we may employ school marks and major 
subjects. Table XXXII contains such data for a marking 
system of five letters and six major fields The formula given 
above, in which 



calls for dividing the square of each entry in Part A, which 
contains the tabulated data, by the product of the total fre- 
quency of its row by that of its column. Thus, for the first cell 
at the upper left, 4* should be divided by 21 X 25; for the 
second cell of the top row, 3* by 21 X 12; and so on. This 
can be done for each cell and results totalled, giving T, but an 
easier method is shown in Part B. In it each squared entry is 
divided by the frequency of its column, these quotients smnmed 
for each row, and each total divided by the frequency of that 
row. Thus, for the first row, 4* divided by 25, 3® divided by 
12, and so on, are summed and their total divided by 21, which 
gives .1673. The same procedure is applied to each row, and 
the results added, their sum being 1.0430. This is T; hence 



1 .0-130 - 1 
1.0430 


= . 20 . 


Instead of summing by rows, we may do the same by col- 
<umns. For this the square of each entry is divided by the 
frequency of its row, these quotients summed by columns, and 
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TABLE XXXII 

Computation op the Coefficient op CoNTmaBNcr 
PART A 


Major Subject 


Mark 

Eng- 

lish 

For. 

Lang. 

' H. E. & 
I. A. 

Math. 

Sci- 

ence 

Soc. 

Stud. 

Total 

A 

4 

3 

6 

2 

3 

3 

21 

B 

8 

4 

13 

6 

8 

6 

45 

C 

7 

4 

10 

7 

8 

5 

41 

D 

5 

1 

4 

5 

3 

3 

21 

E 

1 

0 

1 

1 

2 

1 

6 

Total 

25 

12 

34 

21 

24 

18 

134 


PART B 


JL/16 .1, 36 

4- 15 4- 152 4 - ?? 4 - 51 4 - 
i-^12 4.i54.?524.1?4-5l4.?5^ 

41\25 12 34 21 24 18/ 


4. ± 4_ 15 

21\25 12 34 


4 -? 5 + 9 _ 9 > 

21 24 ISJ 


1 ( 1+1 

6X25 12 


+ 


JL^ 

34 


.1 + 1 + 1 ) 

21 24 18/ 


.1673 

.3388 

.3079 

.1724 

.0566 


1.0430 


^ /1.O43O - 1 ^ : 


1)(6 - 1 ) 
5X6 


each total divided by the column frequency, 
for example, gives 


26\21 45 “*■ 41 21 



The first colunm, 
.1896. 


The sum of these results is T, as before. In this case it is 
1.0429, the slight difference from 1.0430 being due to dropping 
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decimals. It is advisable to sum in both directions, for 
checking. 

The coefficient of contingency may have values from .00 up to 

*j (ni — l )( ri 2 - 1) ^ 

in which ni is the number of classes in one direction and n* that 
in the other. In this example they are 5 and 6; hence C could 
not exceed 

If the obtained value is divided by the upper limit, the result — 
which may be symbolized by Cet ^ — is roughly equivalent to r. 
Here 

Ccorr = '.20 - 5 - .90 = .22. 

In order that the coefficient of contingency and the ratio of 
correlation may be compared directly, the former has been 
computed for the data in Table XXX, for which 17^* = .40. 
For them, C = .52. Conversely, for those in Table XXXII, 
TJjfx ” .17. 

A significant value of C merely indicates that there is some 
relationship between the two series of data, but gives no infor- 
mation as to its direction. Rows or columns of a contingency 
table may be interchanged among themselves without affecting 
the value of the obtained coefficient. 

Product-moment correlation of attributes 

If, in cases involving correlation between an attribute and 
a variable or two attributes, the attribute or attributes con- 
cerned are of such a nature that they may legitimately be 
considered to form a normal distribution, their product- 
moment coefficient of correlation may be found. All that is 
necessary to make this possible is to assign numerical values to 
the classes according to the method explained later, in Chap- 
ter XIII. After this h^s been done, the procedure for comput- 
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ing r is as usual except that care must be exercised to employ 
proper d values. 

Tetrachoric correlation 

The best type of correlation to employ when both variables 
or attributes are dichotomous — that is, divided into only two 
classes — ^is tetrachoric correlation. It is valid only to the degree 
that the variables or attributes would be distributed normally 
if divided into more classes. The number of cases should not 
be quite small, nor should either of the two classes of eithpr 
variable or attribute contain less than .05 of its total frequency. 
The computation of tetrachoric coefficients by means of a set 
of diagrams prepared by Thurstone and others* is relatively 
easy. 

Although the computation of tetrachoric r without the use 
of the diagrams is more difficult, it seems well to illustrate it. 
Part A of Table XXXIII presents data as to the numbers of 


TABLE XXXm 

Computation of the Tetrachoric Coefficient of Correlation 


P>''T A 

PART B 

Ele- 
ment 
No. 2 

Elen.ent No. 1 

14Q 

^ = .596 hi = .971 xi = .243 

1 QO 

— — p;9n — QQft — npin 

R 

W 

T 

W 

67 

63 

130 

250 

.243 X .050 , 63 X 82 - 67 X 38 _ 

R 

82 

38 

120 

2 .1591 X 250.*X .971 X .998 

.006075r»,« + r,e, + .2719 = 0 

T 

149 

101 

250 


_ - 1 ± -s/u - 4 X .006075 X .2719 _ 

2 X .006075 


right and wrong responses on two test elements. It shows that, 
of the 130 individuals who responded incorrectly to element 2, 
67 answered element 1 correctly and 63 incorrectly, and, of the 
120 who had element 2 correct, 82 had correct answers to 

* Chesire, Leona, Saffir, Milton, and Thurstone, L. L., Computing Diagrams for 
the Tetrachoric Correlation Coefficient. Chicago; University of Chicago Book- 
store, 1933. 
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a 

h 

c 

d 


element 1 and 38 incorrect ones. As here, the upper row and 
the column at the left should represent the larger of 
each pair of classes. Part B shows the computa- 
tion. The four cells in the table in Part A are 
thought of as lettered a, h, c, and d, as shown at the 
right. Thus here, a = 67, b = 63, c = 82, and 
d = 38. The first step is to find p-^the fraction of the group 
in the larger class — ^for each of the two things correlated. 

Here pi = = .596 and pi = = .520. Next, the height 

of the normal curve at the point that includes p — .5 of the 
area between it and the mean is found for each. For pi the 
included area is .596 — .5 = .096 and the corresponding height 
is .971, and for pa the area is .520 — .5 = .020 and the height 
.998. Finally the standard deviation distance from the mean 
to the same point — the one including the given area — ^is found 
for each. It is usually symbolized by x. In this case Xi = .243 
and X 2 = .050. 

When these quantities have been found, they are inserted in 
the equation 

be — ad 


TiX2 2 , , 

+ ^tet + 


.\5QlN-hJi2 


= 0 . 


In the example, this gives 

* Tm + r,rt + 


.243 X .050 ,2 


63 X 82 - 67 X 38 


= 0 , 


2 ” ' ' .1.591 X 250.2 X .971 X .998 

which reduces to 

.006075r2, -|- r,« -|- .2719 = 0. 

This may be solved for by any of the standard methods for 
quadratic equations. Perhaps the most convenient in this case 
is that based upon the fact that for the standard form of 
quadratic equation, ax* -I- 6x -j- c = 0, the roots may be ob- 
tained from 

- 6 dz Vfe2 


X = 


4ac 


2a 


Using r,a as x, o as .006075, 6 as 1., and c as .2719, and substitut- 
ing gives 

- 1 ± Vl2 - 4 X .006075 X .2719 




2 X .006075 


= - .27 or - 164. 
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Since the latter is greater than 1.00, it is impossible; hence 

— .27 is the desired value. It is the coefficient between and 
P 2 — that is, between rights on element 1 and wrongs on ele- 
ment 2. Therefore that between rights on the two elements, 
or wrongs on both, is -1-.27. 

Tetrachoric r has the same range of possible value, from 

— 1.00 to +1.00, as product-moment r, and in general may be 
considered as comparable with it and interpreted in the same 
way. 

The phi coefRcient 

For a four-fold table containing data of such nature that 
each series can be divided into only two classes and cannot be 
thought of as forming a normal distribution if more finely 
divided, the 4) {phi) coefficient is available. To compute it 
one should arrange the data as for tetrachoric correlation and 
employ the same four letters, a, 6, c, and d, to represent the 
frequencies in the four cells. Also, as elsewhere, pi is used for 
the larger of the two fractions into which one series of data is 
divided and p 2 for that of the other. The formula is 

_ ad — be 

NWpiO. - pi)p2(l — p*) 

If, instead of numbers of cases in the four cells of a table, frac- 
tions of the total number of cases are used, with a (alpha) = 

j8 (beta) = 7 (gamma) = and 5 (delta) = numer- 

ator becomes a8 — and its denominator loses the term N^. 

To illustrate the computation of <f>, we may assume that the 
data in Table XXXIII, employed for tetrachoric r, are such 
that <f> rather than r« would be appropriate. We had pi = .696 
and p 2 == .520; whence 1 — pi = .404 and 1 — pg = .480. 
Employing these values, the first formula gives 

67 X 38 - 63 X 82 
260V.696 X .404 X .520 X .480 


and the second, 
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.268 X .152 - .252 X .328 
V.596 X .404 X .520 X .480* 

both of which equal —.17. 

The </) coefficient may also be found for tables such as those 
for which a coefficient of contingency is more commonly deter- 
mined, but since the latter is a better measure this is rarely 
done. The two are closely connected, since 

<t> = - ^ and C = 

Vi -C^ \ 1 - 0* 

Also, <j> = VT — 1. For the data in Table XXXII, <!> = 
Vl.0430 - 1. = .21, very close to the value already obtained 
for C, .20. Unless both are .00, <t> exceeds C for the same data, 
the difference becoming larger, the higher their values. 

When secured for data for which it is appropriate, (t> may be 
considered as equivalent to product-moment r. If it has been 
found for continuous variables grouped in two classes each — a 
practice not recommended — dividing it by .637 renders it more 
nearly equivalent to r than it would be otherwise. 


EXERCISES AND PROBLEMS 

1. Compute the ratio of correlation for each part of Exercises 2 
and 3 on pages 134 and 135. 

2. Compute the ratio of correlation for each of the following sets 
of data: 


(a) (b) (c) 


Mark 

Boys 

Girls 

Salary 

H. S. 
Grad. 

Non-H. S. 
Grad. 

Age 

Passed 

Failed 








A 

4 

7 




9-6- 

1 

1 

B 

14 

15 

$2400- 

2 

0 

9-0- 

0 

1 

C 

22 

28 

2200- 

1 

0 

8-6- 

2 

3 

D 

15 

14 

2000- 

4 

1 

&-0- 

1 

2 

E 

3 

6 

1800- 

5 

0 

7-6- 

3 

0 

F 

2 

1 

IGOO- 

11 

6 

7-0- 

7 

3 




1400- 

19 

17 

6-6- 

23 

4 




1200- 

14 

23 

6-0- 

29 

4 




1000- 

8 

18 

5-6- 

4 

0 




800- 

3 

7 

5-0- 

2 

0 




600- 

0 

2 
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3. Compute the bi-serial coefScient of correlation for Parts (a) 
and (b) in Zeroise 2. 

4. Compute the coefficient of contingency for each of the following 
sets of data: 


(a) (6) 


Intel. 

Group 


Place of Birth 


Type 

of 

School 

Years of Training 

Farm 

Small 

Town 

Small 

City 

Large 

City 

1 

2 

3 

4 

5 



Rural 

24 

62 

28 

16 

3 

Sup. 

2 

1 

3 

7 




Town 

Elem. 






Good 

4 

5 

6 

13 

12 

85 

31 

22 

10 

Aver. 

12 

14 

16 

23 

City 

Elem. 

8 

33 

32 



Poor 

10 

16 

14 

11 

04- 

lo 

Town 

High 






Inf. 

3 

6 

5 

8 

0 

0 

6 

38 

16 

City 

High 

0 

0 

1 

30 

49 


(c) 


Best 

Subject 

Intended Vocation 

Prof. 

Semi- 

Prof. 

Sk. 

Tr. 

Unsk. 

Tr. 

Uncer. 

Lang. 

4 

3 

1 

2 

6 

Math. 

11 

14 

5 

3 

5 

Sci. 

9 

13 

10 

4 

3 

Hist. 

6 

4 

2 

0 

7 

Eng. 

5 

4 

2 

2 

2 

I. A. 

4 

6 

12 

3 

3 

H. E. 

1 

3 

7 

2 

6 


5. Correct each coefficient of contingency found in Exercise 4 by 
the formula given in the text. 
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6. Compute the tetrachoric coefficient of correlation for each of 
the sets of ^ta given below. 


(o) 

French 

English 

Passed 

Failed 

Passed 

92 

6 

Failed 

16 

12 


(6) 


Success- 

ful 

Fail- 

ing 

Grad. 

work 

25 

4 

No 

grad. 

work 

45 

10 


(c) 



High 

Low 

Nonparti- 



cipating 

71 

35 

Partici- 



pating 

22 

32 


7. Compute the phi coefficient for each set of data below: 
(a) (6) 



Elementary 

High 

Men 

16 

14 

Women 

95 

31 


(c) 


Latin 

French 

Not 

Taken 

Taken 

Not taken 

147 

73 

Taken 

64 

35 



Non- 

College 

College 

Public 

1436 

387 

Private 

241 

564 
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CHAPTER XIII 


Uses of the Normal Probability Curve 


Determining fraction of cases between two points 

It is frequently useful to be able to determine the fraction of 
cases falling between two points of a normal distribution or in 
each of a number of divisions into which such a distribution 
may be divided. To do the former, all that is necessary is to 
find the standard or median deviation distances of the two 
points from the mean, look, up in the Appendix the area in- 
cluded between each such distance and the mean, and take the 
difference of the areas. For an example, we may refer back to 
the data in Table XI on page 95. Let us assume that we 
wish to know what fraction, or how many, of the cases would 
fall between 82 and 90 if the distribution were normal. The 
mean is 70.25 and the standard deviation 11.47. Therefore 82 


is 


82. - 70.25 
11.47 


= 1.02<7 above the meian; 90 is 


90. - 70.25 
11.47 


1.72(r above it. From the first table in the Appendix we find 
that approximately .346 of the area lies between the mean and 
a point 1.02<r from it, and .457 of it between the mean and a point 
1.72a from it. Therefore the fraction of cases between 82 and 
90, if normality is assumed, is .457 — .346 = .111. Thenmnber 
of cases so located is .111 X 110, the total number in Table XI, 
or approximately 12. 

Since a very common situation in which the areas of parts 
of the normal surface are wanted is the assignment of marks, 
the procedure therefor will be illustrated. There has been 
much argument as to the application of the normal, or any 
other, curve to marking practice and the writer does not wish 
to be understood as recommending rigid adherence to any pre- 
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determined distribution, particularly in small or selected groups. 
He does believe, however, that such distributions have a place 
as at least general guides and that for typical unselected groups 
of pupils the normal is the best one to follow. 

Several decisions must be made before the actual determina- 
tion of the fraction of pupils to receive each mark begins. 
Probably the first is the question of how many marks to 
employ. Any munber may be chosen, but the writer recom- 
mends from five to seven. Five — A, B, C, D, and E — will be 
used here. The next decision is as to whether the normal 
surface is to be divided into portions with equal base lines or 
not. Ordinarily it is assumed that it is so divided; hence that 
assumption will be made in this instance. Finally, since the 
normal surface extends to infinity, a decision must be made as 
to where to cut it off so as to permit equal division of the 
base line. For the purpose of assigning marks, it is usually 
cut off at ±2.5<r or ±3.0<r from the mean. In this case the 
former, ±2.5cr, will be used. 

Figure 21 illustrates a normal curve and its underlying sur- 



Fig. 21 . Division of Normal Surface Into Five Parts 
with Equal Base Lines 


face divided as has just been described. Since it extends from 
— 2.5<r to -l-2.5<r, its total width is fi.Oir; hence each of the 
five divisions has a base line l.Off in length. Therefore the 
points separating the divisions are, from left to right, — 1.5(r, 
— ,5<r, +.5(r, and -fl-fio-, In order to include the whole area, 
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the small portions thereof beyond the extreme points — ^here 
—2.5<r and +2.5<r — are included in the extreme divisions. By 
means of the standard deviation table in the Appendix we find 
that .0668 of the area under the curve is beyond 1.5<t from the 
mean; hence that fraction — or about 7 per cent — of marks 
should be E's and as many A’s. Since .3085 of the area is 
beyond .5<r and this minus .0668 is .2417, that fraction — or 
about 24 per cent — of marks should be D’s and as many B’s. 
Finally, the area from — .5<r to +.5(r is twice .1915 — or .3830 — 
of the whole; so that about 38 per cent of the marks should 
be C’s. The 7,24,38,24,7 distribution thus derived is the 
most commonly recommended and used normal distribution 
of marks. 

If the points at which the normal surface is cut off before 
dividing are closer to the mean, the fractions in the central 
divisions are smaller and those in the extreme ones greater; if 
the points are farther from the mean, the fractions in the cen- 
tral divisions are greater and those in the extreme ones smaller. 
For example, if ±3.0(r instead of ±2.5<r is taken as the limit, the 
per cents of area are approximately 3?, 24, 45, 24, and 31. 

It is not necessary that the base-line distances of the various 
portions of area be equal. The area under the curve, and cor- 
respondingly the per cents of marks, may follow any arbitrary 
pattern of division desired. If a group is of above-average 
ability and is doing work of corresponding quality, a distribu- 
tion with larger per cents of high marks and smaller ones of low 
marks may be adopted, and vice versa. For example, in such 
a case of a superior group the per cents of marks considered 
ideal may be set as follows: E — 5, D — 15, C — 40, B — 30, 
A - 10. 

The standard, sigma, or z-score method of assigning marks is 
based upon an ideal distribution of marks, which may be normal 
or any other type desired. The limits, in terms of standard, a, 
or z-scores, of the divisions of the ideal distribution are found 
and then marks are assigned according to the limits between 
which scores fall, regardless of what fraction of marks actually 
falls within any division. Thus this method assumes an ideal 
distribution and assiunes that the mean of the actual distribu- 
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tion is the same as the mean of the ideal one, but assigns marks 
according to the actual distribution of scores. 

To illustrate this procedure, the data in Table IX on page 89 
may be used. Let us assume that we wish to change the 15 
scores listed there into marks and that the 7,24,38,24,7 
distribution is to be considered the ideal one. First we need 
to find the four points, in terms of <r distances from the mean, 
which divide the normal surface into such parts. These, as 
already given earlier in this section, are — 1.5<r, — .5<r, +.5cr, 
and +1.5<r. Using the mean, 41,8, and the standard deviation, 
31.55, found in Table IX, we compute the four division points 
as follows: 

M - 1.5<r = 41.8 - 1.5 X 31.55 = -5.5 
M - .5<r = 41.8 - .5 X 31.55 = 26.0 
ilf + .5(7 = 41.8 + .5 X 31.55 = 57.6 
M + 1.5(7 = 41.8 + 1.5 X 31.55 = 89.1. 

Therefore marks should be assigned as follows: 

E below —5.5 
D from —5.5 to 26.0 
C from 26.0 to 57.6 
B from 57.6 to 89.1 
A above 89.1. 

According to this, one score, 98, becomes A ; four, 81,81,72, and 
85, become B’s; two, 52 and 37, C’s; eight, 18,13,15,15,23,12,14, 
and 11, D’s; none is an E. 

If, instead of the 7,24,38,24,7 distribution, another — such as 
the 5,15,40,30,10 one, for example — ^is preferred, the only differ- 
ence is that the division points therefor are employed. From 
the Appendix they are found to be in order, — 1.65o-, — .84<r, 
+,25<t, and -f-1.28<r. In terms of scores these are: 

M - 1.65(7 = 41.8 - 1.65 X 31.55 = -10.3 
M - .84(7 = 41.8 - .84 X 31.55 = 15.3 
M + .25(7 = 41.8 + .25 X 31.55 = 49.7 
M + 1.28(7 = 41.8 + 1.28 X 31.55 = 82.2. 

Accordingly, marks should be given thus: 

E below —10.3 
D from — 10 3 to 15.3 
C from 15.3 to 49.7 
B from 49.7 to 82.2 
A above 82.2. 
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This gives two A’s, 98 and 85; four B’s, 81,52,81, and 72; 
three C’s, 18,23, and 37; six D’s, 13,15,15,12,14, and 11; and 
again no E’s. 

Changing qualitative data into a normal distribution 

It is frequently convenient to change nonquantitative data 
into numerical form. If they are qualitative — that is, such 
that they can be arranged in order of degrees of merit or excel- 
lence — they can be transmuted into a quantitative or numerical 
distribution. Naturally the form of this distribution should 
be such as fits the characteristic or fact involved. Since in 
many cases the appropriate form is the normal distribution, the 
procedure for transmuting qualitative into quantitative data 
according to that form will be explained. 

As data for the purpose just stated, we may take ratings of 
teachers, expressed verbally. Let us suppose, therefore, that 
80 teachers have been rated as follows: Excellent — 10; supe- 
rior — ^25; average — ^27; inferior — 14; very poor — 4; and that 
it is assumed they should be normally distributed. The gen- 
eral procedure is to express each of the five ratings in terms 
of the average distance of the portion of area under a normal 
curve that represents it from the mean or center of the curve. 
This distance may be in terms of any measure of variability, but 
the standard deviation is most commonly employed; hence, the 
distance used is a standard or z-score. 

Figure 22 illustrates the division of the normal curve into 



Fis. SS. Graphical Rcprctcntation of Data in Tabic XXXIV 
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portions corresponding to the five ratings as given in the last 
paragraph, and Table XXXIV contains the computation neces- 
sary to determine their standard deviation distance from the 

TABLE XXXIV 

Transmutation of Qualitative Data into Standard Deviation Distances 
FROM THE Mean op a Normal Distribution 

Fraction 
of Area 
Between 
Outer 
Ordinate 
and 

Maximum 
Ordinate 

.5000 
.3750 
.0625 
.2750 
.4500 
.5000 

mean. Thus, in the figure, the portion of the area at the right 
is ^ or .125 of all; the next is ^ or .3125 of all; and so on. 

oO oU 

In the table the first two columns contain the ratings and 
the number of each, and the third, the latter divided by the 
total munber, 80. The fourth-column entries are the fractions 
of the whole area under the curve between the ordinate at the 
outer limit of each of the five portions and the mean or maxi- 
mum ordinate. Since the outer ordinate of each extreme por- 
tion is at infinity from the mean ordinate, half or .5000 of the 
area is between it and the latter; since .1250 of the area is in 
the excellent division, .5000 — .1250 = .3750 of the area is 
between the outer ordinate of the next or superior division and 
the mean ordinate; and so on. The heights of the ordinates 
that include the areas just given are to be found in the fifth 
colunm. At infinity, corresponding to an area of .5000, the 
height is .0000; for each of the other fractions of area the height 
of the outer ordinate which includes it is found from the 
Appendix. Thus the ordinate that includes an area of .3750 


Difference 
Height in 

of Heights 

Ordinate of 

Ordinates 


Mean 

Standard 

Deviation 

Distance 

from 

Mean 


+.5160 

+.4717 

-.2359 

-.4933 

-.2585 


Rating 

Number 

of 

Cases 

Fraction 

of 

Area 

Excellent 

10 

.1250 

Superwr 

25 

.3125 

Average 

27 

.3375 

Inferior 

14 

.1750 

Very Poor 

4 

.0500 
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between it and the mean ordinate is .5160 high; that which 
includes an area of .0625 is .9877 high; and so on. The next 
column contains the differences in heights of the ordinates just 
foimd, each difference having the sign given by subtracting a 
height from the one just below it. Thus .5160 — .0000 = 
+ .5160; .9877 — .5160 = +.4717; and so on. Finally, the 
desired standard deviation distance of each group of ratings 
from the mean is found by dividing each difference by 2.5066 
times the corresponding fraction of area. For the excellent 


group, 2 50^^^1250 ~ +1-65, which is its average <r distance 

from the mean. Similarly, the average distance of the superior 
group is found to be +.60(r, that of the average group — .280", 
that of the inferior group —1.12a, and that of the very poor 
group — 2.06<r. These may then be used where numerical 
rather than verbal ratings are desired. 

It is possible, by a somewhat easier method, to secure median 
instead of mean distances, but for most purposes they are less 
satisfactory. They are regularly somewhat less, positively or 
negatively, than the mean distances. To secure them, all that 
is necessary is to find the fraction of area between the ordinate 
that divides each portion into two equal parts and the mean 
and find its distance from the mean. Thus, for the excellent 
group, .4375 of the area lies between the dividing point and the 
mean, and the corresponding a distance is +1.53. For the 
other ratings the median a distances are, respectively, +.58, 
-.27, -1.09, and -1.96. 

Probably the most frequent occasion for the application of 
this procedure is in cases wherein ratings or other data from 
several sources are to be combined or compared. For example, 
if one supervisor assigns ratings to a group of teachers according 
to the distribution in Table XXXIV, whereas another, rating 
the same 80 teachers, gives 5 of them excellent, 12 superior, 
33 average, 19 inferior, and 11 very poor, it is evident that the 
latter’s standards are more severe than those of the former 
Therefore an excellent rating as given by him represents a higher 
degree of merit than a similar rating by the first rater. Simi- 
larly, the significance of the other fom* depends upon the dis- 
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tribution of ratings. To combine or compare them, we should 
express the meaning of those given by each rater in similar 
fashion. 

To illustrate further the application of this procedure, we 
may suppose that each of the 80 teachers was rated by two 
supervisors, A and B, who gave the two sets of ratings reported 
above and also by two others, C and D, whose ratings along 
with those of A and B are given in Part A of Table XXXV. 
From the data there it is evident that C gave more extreme 
ratings, both high and low, and D fewer extreme ones, than A 

TABLE XXXV 

Ratings Assigned Eighty Teachers by Four Supervisors 
AND Standard Deviation Distances from the Mean 
Corresponding Thereto 


PART A. ASSIGNED RATINGS 


Rating 

Supervisor 

A 1 

B 

C 

1) 

Excellent 

10 

5 

10 

4 

Superior 

25 

12 

18 

14 

Average 

27 

33 

22 

42 

Inferior 

14 

19 

19 

17 

Very Poor 

4 

11 

11 

3 


PART B. STANDARD DEVIATION DISTANCES 


Rating 

1 Supervisor 

i 

A 

B 1 

C 

D 

Excellent 

+1.65 

+1.97 

+1.65 

+2.06 

Superior 

+.60 

+1.11 

+.73 

+1.12 

Average 

-.28 

+.22 

-.03 

+.03 

Inferior 

-1.12 

-.67 

-.67 

-1.11 

Very Poor 


-1.60 

-1.60 , 

-2.18 
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and B. In Part B of Table XXXV the mean o distance from 

the mean, or 2-score, corresponding to each rating by each 

supervisor is given. Those for A are as found in Table XXXIV 

and the others were computed by the same process. To secure 

the mean rating given a teacher by the four supervisors, the 

four (T distances corresponding to the four ratings should be 

averaged. Thus, if Miss Smith was rated superior, average, 

superior, and superior by the four supervisors, in the order used 

in the tables, her mean rating is +.67, the average of +.60, 

+ .22, +.73, and +1.12. Similarly, if Miss Jones is rated 

average, inferior, inferior, and average, in order, her mean rating 

. -.28 - .67 - .67 + .03 
IS = — .40. 

4 


Comparing an actual with an ideal distribution 


Workers with educational data sometimes wish to compare 
actual with ideal or hypothetical distributions. Since the most 
frequently used type of ideal distribution is the normal, the 
procedure will be illustrated for that type, but it may be applied 
to others also. Generally the simplest method of making such 
a comparison is to graph the actual and the theoretical dis- 
tributions on the same figure and scale. An example of so 
doing is shown in Figure 23. The solid line therein represents 
the actual distribution ; the broken line, the ideal or normal one. 
The latter was constructed by placing its center at the mean. 


where its height is given by the formula y© = 


N _ N 1 
2.5066(r’ 


and then, by means of standard deviation distances and the 
first table in the Appendix, locating points through which it was 

750 

drawn. Here «o, the maximum ordinate, = ^ v/ ^ = 

2.5066 X 1.864 

160.50. By laying off distances .25a or even .5<r apart in each 
direction from 64.80, the mean, and multiplying 160.50 by the 
entries in the height column corresponding to the a values 
employed, the points through which the best-fitting normal 


« ^ If the data are grouped in classes and the figure constructed on that basis, as 
here, a should be in terms of intervals; if they are not grouped, <r should be in 
terms of score units. 
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curve should pass were determined. Thus, for example, 
.5o- = .5 X 9.32 = 4.66. This distance below the mean gives 
64.80 — 4.66 = 60.14 and above the mean 64.80 + 4.66 = 
69.46. The height entry corresponding to .6<r is .8825; hence 
the height of the desired normal curve at 60.14 and 69.46 is 
.8825 X 160.50 = 141.64. Similarly, at 64.80 - 9.32 = 55.48 
and at 64.80 + 9.32 = 74.12 its height is .6065 X 160.50 = 
97.34, and so on. 

If the theoretical frequencies given in the sixth column of 
Table XXXVI have been determined, the best-fitting normal 



curve can be constructed more easily from them than by the 
method used above. The entries in this column are the heights 
of the normal curve at the mid-points of the classes; so all that 
is necessary is to locate points in accordance with that fact and 
draw a curve through them. The first such point is at a height 
of .41 above 32.5, the mid-point of the 30-class; the next is 2.21 
above 37.5; the next is 9.24 above 42.5; and so on. 

Inspection of Figure 23 reveals that no very precise idea of 
the amount of difference, or of the chances that it is significant, 
can be obtained therefrom. For such purposes computational 
rather than graphic methods are needed. 



Comparison of Actual and Normal Curves 
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64.80 a = 9.32 j/o = 100.50 X* = 8.83 
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There are several computational methods of making com- 
parisons between actual and theoretical curves, not all of which 
lead to the same conclusion in a given instance. The most 
commonly used one is that known as the x* {Chi-square) Test 
for Goodness of Fit. Its validity depends upon several assump- 
tions, and interpretations based upon it are only approximate. 
It should be based upon a sample of not less than 500 cases, 
with not less than 10, or perhaps 5, in any single class. When 
classes contain fewer than this number, two or more adjacent 
ones should be combined for the last steps in the computation. 

The method jvist mentioned is illustrated in Table XXXVI. 
The first two columns thereof contain the actual distribution; 
the third, the class mid-points; and the fourth, the distance of 
each mid-point from the mean, 64.80. No attention need be 
given the sign of this distance. In the fifth column is the cr 
distance of each mid-point from the mean, obtained by dividing 
the entry in the previous column by <r, 9.32. Next the theoret- 
ical frequency for each class is found. It is determined by 
multiplying the height, as given in Table XLIV, at the <r dis- 
tance by the maximum ordinate, here 160.50. Thus the tabled 
height of the normal curve at 3.51(r from the mean is .0021 and 
this times 160.50 equals .34.^ The following column contains 
the difference between each actual and corresponding theoret- 
ical frequency, taken without regard to sign. To avoid the use 
of too small frequencies, those in the first three classes have 
been combined, as also have those in the last three. Therefore 
the first difference is the sum of the first three actual frequencies 
minus that of the first three theoretical frequencies. This gives 
(1 -I- 3 + 9) - (.34 -I- 1.96 -1- 8.22) = 13. - 10.52 = 2.48 as 
the first difference. The second is 4.40, that between 22. and 
26.40, and so on for the others. 

X* may be found by the formula 



> Another method of finding theoretical frequencies may be employed. It is 
to find the a distance of each class limit, rather than of the mid-point, then the 
normal areas beyond each of the two limits of each class, and fin^y to take the 
difference of these areas times N, 
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in which / is the theoretical frequency. Thus the first entry 
in the next column is .69, obtained by dividing 2.48* by 10.52; 
the next Is .73, which is 4.40* -f- 26.40; and similarly for the 
others. Their sum — that is, x* — is 8.83. 

To interpret this, or any value of x®, a table such as XXXVII 
is needed. It gives values of P, probability, corresponding to 
those of X* and numbers of classes.® In this case there are 
10 classes, since the number remaining after combinations of 
small classes should be used. Interpolating in the table gives 
P = .46 for x^ = 8.83 and 10 classes. This means that there 
are 46 chances out of 100, or 46 to 54, or about 1. to 1.17, that 
random sampling would yield a fit no better than that in this 
case. 

Although interpretation in terms of adjectives is not very 
satisfactory. Culler’s suggestions may be helpful. They are 
that if P is .75 or more, the fit of the actual to the theoretical 
distribution is superlative;, if P is .50 to .74, it is excellent; if 
.25 to .49, good; if .10 to .24, fair; if .05 to .09, poor; if .00 to 
.04, unacceptable. Another suggestion is that if P is less 
than .20 the distribution probably resembles some other type 
of curve more closely than it does the normal. 

The last three columns of Table XXXVI illustrate further, 
but rarely worth-while, analysis of the situation. It is the 
determination of the chances that the difference between the 
actual and the theoretical frequency of each class is significant. 
The first of these three columns contains the standard error 
of this difference. It is found by the formula 

, _ [fw^) rf_~p, 

- yj yjf 

• Degrees of freedom instead of classes may be used. Degrees of freedom may 
be defined as the number of independent observations or yariates minus the 
number of true measures or parameters computed or estimated from them. In 
the case of grouped distributions such as the one above, for which the three 
measures N, M, and <r are found, the number of degrees of freedom is regularly 
three less than the number of classes. In this special case, however, it is best 
taken as only one less. Justification of this may be found in Peters and Van 
Voorhis, also elsewhere. See Peters, Charles C., and Van Voorhis, Walter R., 
• Statistical Procedures and Their Malhemalical Bases. Pages 412-414. New York: 
McGraw-Hill Book Company, Inc., 1940. 
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♦ This table might be given according to numbers of degrees of freedom rather than of classes. In this case the number of degrees 
of freedom should be taken as one less than the number of classes. 
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in which / is, as above, the theoretical frequency. Thus, for the 
first difference, 2.48, 


*diff 


V 


10 . 52 ( 750 . - 10 .. 52 ) 
750 . 


= 3 . 22 . 


In the next column are the quotients of the differences divided 
by their respective standard errors, such as 2.48 -r- 3.22 = .77. 
By considering this as an entry in the first column of Table 
XLIV and finding the corresponding entry in the last coliunn 
of the same table, we obtain the chances to 1 that the difference 
between actual and theoretical frequencies of a single class is 
significant. For .77 they are 3.5; for the next entry, .87, they 
are 4.2; and so on. For only one class, that from 65 up to 70, 
are the chances great enough to approach what is usually 
considered reasonable certainty. 

The procedure just described is not limited in its application 
to comparisons with a normal distribution, but may be em- 
ployed for those with others also. Likewise it may be used 
with data in such tables as those from which tetrachoric, phi, 
and contingency coefficients are computed. In such cases 
what is known as an independence table is set up and the ob- 
tained data compared with it. An independence table is one 
that contains a purely random distribution within its cells of 
the cases composing the class totals; in other words, it is a 
table showing how the given numbers of cases would be dis- 
tributed if there were zero correlation between the variables or 
attributes concerned. 

To illustrate the application of the test to such a situation, 
the data used for the computation of the coefficient of con- 
tingency will be employed. The entries in a number of the 
cells are less than 5, which is not desirable, but the procedure 
will be applied so as to show the method. The effect of group- 
ing small frequencies together tends to be to increase P, which 
in this instance is found to be so high without doing that as to 
leave slight possibility of increase. The first part of Table 
XXXVIII repeats the data in the same part of Table XXXII 
on page 190. Part B is the corresponding independence table. 
Each entry therein is the product of the frequencies of the row 
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TABLE XXXVIII 

Application op x* Test to Contingency Table by Means of 
Independence Table 


PART A. actual DATA 


Major Subject 


Mark 

English 

For. 

Lang. 

Ind. 

Arts 

Math. 

Science 

Soc. 

St. 

Total 

A j 

4 

3 

6 

2 

CO 

3 

21 

B 

IS 

4 

13 

6 

8 

6 

45 

C 


4 


7 

8 

5 

41 

D 

5 

1 

4 

5 

3 

3 

21 

E 

1 

0 

1 

1 

2 

1 

6 

Total 

25 

12 

34 

21 

24 

18 

134 


PART B. independence TABLE 


Major Subject 


Mark 

English 

For. 

Lang. 

Ind. 

Arts 

Math. 

Science 

Soc. 

St. 

Total 

A 

3.92 

1.88 

HIM 

3.29 

3.76 

2.82 

21 

B 

8.40 

4.03 

Wife HI 



6.04 

45 

C 

7.65 

3.67 

10.40 

6.43 

7.34 

5.51 

41 

D 

3.92 

1.88 

5.33 

3.29 

3.76 

2.82 

21 

E 

1.12 

.54 

1.52 

.94 

1.07 

.81 

6 

Total 

25 

12 

34 

21 

i 

24 

18 

134 


PART C. computation OP X* 


Major Subject 


Mark 

English 

For. 

Lang. 

Ind. 

Arts 

Math. 

Science 

Soc. 

St. 

Total 

A 

.00 

.67 

.08 

.51 

.15 

.01 

1.42 

B 

.02 

.00 

.22 

.16 

.00 

.00 

.40 

C 

.06 

.03 

.02 

.05 

.06 

.05 


D 

.30 

.41 

.33 

.89 

.15 

.01 


E 


.54 

.18 

.00 

.81 

.04 


Total 

.39 1 

1.65 

.83 

1.61 

1.17 

.11 



x» « 5.76 P » 1.00- 














































































USES OF THE NORMAL PROBABIUTY CURVE 


215 


and column wherein it falls divided by the total number of 
cases. Thus for a mark of A in English the independence value 

Ol V 9*^ 

is — — = 3.92. This is the number of A’s in English that 

would result if there were no association at all between subjects 
and marks. Similarly all the other independence values are 
found. In Part C are the squares of the differences divided by 
the independence values. For example, for A’s in English the 
difference is 4. — 3.92 = .08 and .08* - 5 - 3.92 = .00 to two 
places. For A ’sin foreign language, 3. — 1.88 = 1.12, and this 
squared divided by 1.88 gives .67, and so on for the other cells. 
The sum of these entries is 5.76, which is x*. 

If the coefficient of contingency has been found, x® can be 
computed very easily. It is merely N(T — 1). For these 
data N is 134 and T was found to be 1.0430; hence 

X* = 134.(1.0430 - 1.) = 5.76, 

as before. If T is not available, but only the coefficient of 

NC^ 

contingency itself, x* can be found by the formula ^ 

In this instance this gives 


134. X .20* 
1 . - . 20 * 


5.58, 


which differs from 5.76 because C was computed and used to 
only two places. Conversely, if x* is known and C is desired, 
the formula 


4 , 


Ix^ + N 

may be employed. Thus here. 




5.76 


76 + 134. 


= . 20 . 


Although Table XXXVII was constructed for use with 
grouped distributions rather than two-way tables, it may be 
employed with the latter by finding the number of degrees of 
freedom, adding 1, and considering this as the number of classes. 
The number of degrees of freedom in a two-way table is regu- 
larly the product of ni — 1 and nj — 1, ni being the number of 
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classes in one direction and 712 that in the other. Here this 
gives (6 — 1) (5 — 1) = 20 degrees of freedom, corresponding 
to 21 classes. Table XXXVII ends with 20 classes, but it is 
evident that for x* = 6-76 and 21 classes P is nearer 1.00 than 
.99. In other words, there are practically 100 chances in 100 
that the differences of actual from theoretical data are mere 
random variations and do not signify any real tendency toward 
relationship between marks and school subjects. 

As another example, the data employed for tetrachoric cor- 
relation on page 192 may also be subjected to the x* test. 
They, the independence table for them, and the determination 
of X* appear in Table XXXIX. Since the steps are just the 


TABLE XXXIX 

Application op x* Test to Tetrachoric Table 


PART A. 


ACTUAL DATA 


Ele- 
ment 
No. 2 

Element No. 1 

i 

R 

W 

T 

w 

67. 

63. 

130. 

R 

82. 

38. 

120. 

T 

149. 

101. 

250. 


BY MEANS OF INDEPENDENCE TABLE 


independence TABLE 


Ele- 
ment 
No. 2 

Element No. 1 

R j 

w 1 

T 

W 

77.5 

52,5 

130 

R 

71.5 

48.5 

120. 

T 

149.0 

101.0 

250. 


COMPUTATION OF x* 


Ele- 1 
ment 
No. 2 ' 

Element No. 1 

R 

If 

T 

W 

1.42 

2.10 

3.52 

R 

1.54 

2.27 

3.81 

T 

2.96 

4.37 

7.33 


X* = 7.33 P = .01 


PART B. BY MEANS OF FRACTIONAL ENTRIES 


Ele- 
ment 
No. 2 

Element No. 1 


R 

Tf 

T 


W 

.268 

.252 

.520 

, _ 250.(.268 X .152 - .252 X .328)* _ _ 

^ .596 X .404 X .520 X .480 

R 

.328 

.152 

.480 

T 

.596 



1-4 

q 

H 


same as in Table XXXVIII, they will not be explained again. 
For this, as all tetrachoric tables, there is only one degree of 
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freedom, which corresponds to 2 classes in Table XXXVII. 
Therefore for x* = 7.33, P is about .01; so there is about 1 
chance in 100 that the differences are due to variations of 
sampling rather than to real relationship. 

For such a table x* also be computed by employing fre- 
quencies expressed as fractions of the total niunber of cases. 
Using the same symbols as for the <l> coefficient, the formula for 
this method is 

j _ N(aS — 

For the data in Table XXXIX this gives 

250.(.268 X .152 - .252 X .328)* 

.596 X .404 X .520 X .480 

which is slightly different from 7.33 because of dropping 
decimals. 

A simpler but less exact measure of goodness of fit is the per 
cent of actual and theoretical frequencies that agree. One way 
of finding this is to compare the two for each class or cell, as 
the case may be, find the sum of the smaller of each such pair 
for the whole table, and compute its per cent of N. Another 
is to subtract half the sum of all the differences, without regard 
to sign, from N and find what per cent the remainder is of N. 
For the data in Table XXXVI the first gives 

10.52 + 22. + 54. -t- 113.70 4- 153.82 + 155.38 + 

105. + 63. + 28.55 + 11.86 = 717.83, 

which divided by 750 yields 96— per cent. The second gives 
750. - = 718.07 

and this divided by 750 likewise yields 96— per cent. These 
methods may also be employed for the data in Tables XXXVIII 
and XXXIX. 

As was stated on p. 182, x* “f^ay be used to estimate the 
reliability of differences between ratios and coeflBcients of cor- 
relation. For this purpose 
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For the data in Table XXIX, for which 

= .902, ijy, = .914, r = .892, N — 50, and n = 12, 

X* for = (50 - = 3.60 

and 

for Tj„ = (50 - 12)- ——---- = 9.17. 

Employing these values in Table XXXVII, with the number 
of classes taken as n — 1 or 11, yields P = .97 and .51, respec- 
tively. There are, therefore, about 97 chances out of 100 that 
the excess of curvilinear over rectilinear relationship is due to 
chance in the first case, and 51 out of 100 that it is in the 
second. 

Determining dlfRculties of test elements 

For purposes of weighting and other reasons those who con- 
struct and use tests may desire to determine the relative diffi- 
culties of test elements. The generally accepted method, one 
that has been employed for many standard tests, is based on 
the assumption that, if the area under the normal curve is 
divided into two portions — one corresponding to the fraction 
of individuals passing an element and the other to that of those 
failing it — the point of division is an index of difficulty. 
This point is expressed in terms of some measure of variability, 
usually either the standard or the median deviation. This 
assumption has not been proved, but in practice appears to 
give reasonably valid results. 

The details of applying the procedure just suggested vary 
somewhat. Either fraction passing or fraction failing, or 
sometimes one and sometimes the other, may be employed. 
Perhaps the easiest way is to use whichever is the smaller, 
locate it in the fourth column of the table of the normal curve 
in the Appendix, find the corresponding entry in the first 
column, and prefix a plus sign to it if the fraction passing was 
used, a minus sign if the fraction failing was taken. This 
should be done because, if less than half pass an element, it is 
more difficult than average; whereas, if less than half fail it. 
it is less difficult. 
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To illustrate this, let us suppose that 85 per cent of those 
taking a test pass and 15 per cent fail a certain element. The 
<r distance corresponding to 15 per cent of area beyond an 
ordinate at that distance is about 1.04; hence the element is 
1.04<r less difficult than average, so receives a difficulty rating 
of — 1.04(r. For one passed by 30 per cent and failed by 
70 per cent we find that about .52<r corresponds to an area 
of .30; hence rate it +.52<r. If an element is passed or failed 
by everyone, it is rated at — « or + » . 

In order to eliminate minus signs, some point far enough 
below average so that no element is likely to be easier than one 
rated there is often chosen and difficulty values expressed as 
distances above that. Thus, for the well-known and much 
used T-scale and T-scores,^ based on a normal distribution of 
twelve-year-olds, a point 5<r below the mean was chosen. 
Accordingly a difficulty value of — 1.04<r becomes 5. — 1.04 = 
3.96; one of -l-.52<r becomes 5. 4- .52 = 5.52; and so on. Also 
a multiplier, 10, was used and fractions dropped, thus giving a 
scale ranging from 0, with 50 as average, up to 100. On this 
scale a difficulty value of 3.96 as given above becomes 40; 
one of 5.52 becomes 55; and so on. 

Equally noticed differences 

The theorem that equally noticed differences are equal, with 
certain concomitant additions, is sometimes applied to deter- 
mine the quality or merit of specimens, such as those used in 
handwriting, drawing, composition, and other quality scales. 
The validity of its application depends upon several assump- 
tions, the chief of which are that the distributions of ratings 
of the various specimens are normal in shape and that areas 
under the normal curve and base-line distances validly repre- 
sent differences in raters’ judgments and in quality, respec- 
tively. In general, the results from this method correlate so 
highly with those obtained by using average ratings on a 
quantitative scale that which is employed is a matter of 
convenience rather than of validity of results. 

, * McCall suggested this procedure and named it in honor of Thorndike and 
Terman. 
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To illustrate this method, let us assume that we have had a 
group of raters or judges compare specimen A with B, B with 
C, and C with D, and that 61 per cent of them have rated B 
as better than A, 75 per cent have rated C as better than B, 
and 94 per cent have rated D as better than C.® The follow- 
ing procedure is merely to subtract 50 per cent from each per 
cent of better ratings and find the deviation distance corre- 
sponding to this area from the mean. Generally the median 
deviation is used, although any other can be used. For A and 
B, we have 61 per cent — 50 per cent = 11 per cent, and from 
Table XLV in the Appendix we find that the median deviation 
distance that includes this fraction of the area between an 
ordinate at its distance from the mean and the mean is ap- 
proximately .41; therefore specimen B is AlMdD better 
than specimen A. By similar procedure, C is l.OOMdD 
better than B, and D is 2.30MdD better than C. Likewise, C 
is AlMdD -|- l.OOMdD = 1 AlMdD better than A; D is 
.41JlfdD -f- l.OOMdD -H 2.30MdD = 3.71MdD better than A; 
and D is l.OOMdD + 2.30MdD = 3.30MdD better than B. 

If it is desired, each possible pair may be compared and the 
results obtained from the various direct and indirect compari- 
sons may be averaged. Thus, if 99 per cent of the raters placed 
D above A, the corresponding difference in merit would be 
3.45MdD rather than the 3.71MdD found indirectly through 
B and C. An average of the two, 3.58, is probably better than 
either. If many specimens are involved, the niunber of possi- 
ble indirect comparisons becomes so great as to render the task 
of making and averaging them so laborious that it is rarely 
carried out in full. 

If the per cent of judges rating one sample as better than 
other is 100, the difference cannot be determined more specifi- 
cally than as approaching infinity. If it is 50, the other 50 per 
cent evidently consider it worse; hence the two specimens are 
of equal merit. If it is less than 50, one may either reverse the 
direction of comparison or take the difference between it and 


^ In this method raters must give no tied or equal mtings, but must always rate 
one of the pair of specimens being compared above the other. 
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50 and interpret the corresponding MdD distance as the amount 
the sample in question is below the other. 

In connection with this procedure we often wish to know 
how many judges are necessary to result in a given desired 
degree of reliability. This depends on both their number and 
the per cent that rate one sample above the other. If the 
reliability of ratings from a given number of judges is known, 
that for some other number is given approximately by the 
Spearman-Brown formula, fovmd on page 151. More exactly, 
however, the number of judges necessary for a given degree of 
reliability — ^that is, for a given probability that a specimen 
voted better than another would likewise be rated better by 
another similar group of judges — ^is the next larger integer than 
the value of the formula 

x^pq 

(p - .6)*’ 

in which x is the number of standard errors corresponding to 
the degree of reliability, p is the fraction of judges rating one 
specimen better than the other, and g is 1 — p, or the fraction 
of judges rating it worse. Table XL gives the numbers of 
judges, as determined by this formula, necessary to insure the 


TABLE XL 

Numbers of Judges Necessary to Insure Various Degrees of 
Reliability of Ratings of Specimens 


Standard 

Errors 

Chances to 1 
That Specimen 
Rated Better 

Is Better 

Per Cent of Judges Rating One Specimen 
Better than Otlier 

55 

60 

65 

70 

75 

80 

85 

90 

95 

3.0 

739 

892 

217 

92 

48 

28 

17 

10 

6 

3 

2.5 

161 

619 

151 

64 

33 

19 

12 

7 

4 

2 

2.0 

43 

397 

97 

41 

22 

13 

8 

5 

3 

1 

1.5 

14 

223 

55 

23 

12 

7 

5 

3 

2 

1 

1.0 

5.3 


25 

11 

6 

4 

2 

2 

1 

1 

.5 I 

2.2 

25 


3 

2 

1 

1 j 


1 

1 
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chances, corresponding to certain values of the standard error 
and per cents of judges rating one specimen above another, that 
the specimen so rated really is the better of the two. It makes 
apparent that the number of judges needed decreases quite 
rapidly as the per cent rating one specimen above another 
increases. 

Equivalent or comparable measures 

Although the topic of equivalent or comparable measures 
has been touched previously, it seems well to present a short 
summary treatment here. Very frequently educational work- 
ers find it desirable to compare or combine scores derived from 
various tests and other sources with different scoring systems 
and units. To do so more than very superficially necessitates 
that they be transmuted to a common system, that is, be ex- 
pressed on equivalent scales with equivalent units. 

If the distributions of scores involved are nearly normal, or 
only moderately skew, the standard or z-score is generally the 
most valid comparable score. As already given, it is 

Score — M 

2 = j 

<7 

that is, it expresses scores in terms of their <r distances from the 
mean. It assumes that the mean score of one distribution may 
be considered to correspond to that of the other, and similarly 
for their standard deviations. Similar scores based upon some 
other appropriate combination of a measure of central tendency 
and one of variability could be employed, but rarely are. 

Another means of rendering scores comparable is the use of 
the equivalent score equations, which are simply the regression 
equations with r = 1.00. By their use scores in other dis- 
tributions may be changed so as to be expressed in the same 
units and on the same scale as those in one taken as standard; 
or an ideal distribution with convenient values, such as Af = 50 
and a = 10, may be set up and used. Results from the applica- 
tion of this and the «-score method to the same original scores 
correlate perfectly. In general it is merely a matter of con- 
venience which is used. Since the 2-score method is much 
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more commoiUy employed and is probably easier, it appears 
preferable. _ 

If distributions approximate rectangular form, percentile 
ranks, or just ordinary ranks if the munbers of cases involved 
are the same, are the best. They are often employed regardless 
of the shape of distributions, but so doing is liable to lead to 
erroneous conclusions. If a distribution approximates nor- 
mality, [/-shape, or any of various other types, the differences 
between percentile ranks or points apparently equidistant are 
not equal. For example, in such cases the difference between 
P 90 and Pgo is not the same as that between Pgo and P 70 or as 
that between Pgo and Pgo. In a rectangular distribution such 
differences are equal. 

A th rd type of comparable measure that is morg often em- 
ployed elsewhere than in educational work is the ratio score. 
It is valid only if three conditions are fulfilled: the zero 
points of the series of scores being compared must represent the 
same amounts of whatever they measure; some other point, 
usually the mean, of one series must correspond to the same 
point of the other; and the laws of development, that is, the 
shapes of the distributions of scores, of the things measured 
must be the same. The third condition is often not true, or at 
least not known to be true, and the other two are far from 
universally so; hence the validity of ratio scores is limited. 

X 

The formula for a ratio score is simply •=, in which X repre- 

ti 

sents the score and B the base. Often, as suggested above, this 
X 

becomes • To illustrate the use of ratio scores, we may assume 
M 

that a distribution of pupils’ scores upon an achievement test 
and that of their mental ages meet the three conditions stated 
above, and that the means are 65 and 11—8, respectively. If 
pupil A makes 56 on the test and has a MA of 11 —4, his ratios 

are It = .86 and I = .97.® A comparison of these ratios 
65 11—0 

® The easiest way to handle this ratio is to reduce the MA*s to months. Thus 

1 OA 

11-4 = 136 months and 11-8 = 140 months, and — = .97. 
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indicates that he stands distinctly lower on the test than in 
MA. If pupil B has a score of 83 and a MA of 15—1, his 

ratios are = 1.28 and ^ = 1.29, which indicate that 

DO 11 — 0 

his test score and MA are relatively very nearly the same. 


EXERCISES AND PROBLEMS 

1. Find the number of cases falling between each of the following 
pairs of scores, assuming that each distribution is normal and has a 
mean, standard, or median deviation, and number of cases as given: 

(а) Between 40 and 50, if Af = 34.5, a = 7.8, and N = 90. 

(б) Between 60 and 80, if M = 63.6, <r = 12.2, and N = 125. 

(c) Between 5 and 32, if Af = 49., MdD = 8.4, and N = 420. 

2. Determine the per cents of pupils who should receive each 
mark if they are normally distributed according to the following 
specifications: 

(а) Six marks; curve cut off at =fc2.5<r. 

(б) Five marks; curve cut off at ±4.0 MdD. 

(c) Seven marks; curve cut off at ±3.0<r. 

3 . Assign marks according to the 2 f-score method to the scores 
given in each part of Exercise 6 on page 70. Assume the following 
bases for the several parts: 

(a) Five marks; curve cut off at ±2.5o-. 

(b) Six marks; curve cut off at ±3.0<r. 

(c) Four marks; curve cut off at ±3.0or. 

4. Change each of the following sets of data into a normal distribu- 
tion, with a values for each class: 

(a) High, 5; good, 17; fair, 23; poor, 8. 

(b) Superior, 16; good, 38; average, 64; poor, 27; failing, 6. 

(c) A, 14; B, 18; C, 24; D, 16; E, 9; F, 2. 

6. Find the probability that each of the distributions in Exercise 1 
on page 69 differs from normal. 

6. Find the probability that each of the sets of data in Exercises 
4 on page 196 and 6 on page 197 differ from an independence table. 

7 . Determine the difficulty of test elements answered incor- 
rectly by each of the following per cents of pupils: (a) 3; (6) 17; (c) 
81; (d) 46; (e) 99; (/) 65. 
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8 . If three supervisors assign ratings to a group of 50 teachers 
as shown below, find the average a rating of each of the two teachers 
whose ratings are given. 


Supervisor 

Ratings 

A 

B 

C 

D 

E 


Brown 

3 

12 

18 


7 

Jones 

4 

7 

28 

8 

3 

Smith 

10 

15 

15 

8 

2 


Teacher No. 1: B hy Brown, C by Jones, A by Smith. 
Teacher No. 2: D by Brown, C by Jones, C by Smith. 


9 . Find the difference in quality of pairs of specimens, one of 
which is rated better than the other by each of the following per cents 
of judges: (a) 68; (h) 82; (c) 36; (d) 97. 


10 . The following are the scores made by a class of 40 pupils on a 
series of tests, one in each of several subjects: 


Reading 

Arithmetic 

Language 

Writing 

Spelling 

50-2 

24-1 

70- 1 

60- 2 

45- 3 

45-3 

22-0 

60- 4 

50- 6 

40-14 

40-4 

20-3 

50-12 

40-21 

35-11 

35-6 

18-4 

40-16 

30- 8 

30- 6 

30-8 

16-3 

30- 4 

20- 3 

25- 5 

25-7 

14-9 

20- 2 


20- 1 

20-6 

12-6 

10- 1 



15-2 

10-5 




10-2 

8-5 





6-2 

4-0 

2-0 

0-2 


Find the average standing, by both ^-scores and percentile scores, 
of pupils whose scores on the five tests are as follows: 


PupU 

Reading 

Arithmetic 

Language 

Writing 

Spelling 

A 

36 

21 

51 

44 

40 

B 

23 

9 

26 

33 

29 

C 

52 

19 

61 

53 

48 

h 

13 

7 

25 

22 

30 

E 

33 

17 

44 

42 

36 
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11. Find the chances that each one of the pair rated better in 
Exercise 9 really is better if each of the following numbers of judges 
participated in rating: (a) 10; (b) 76; (c) 24; (d) 3. 

12. Find the 2 ;-score, the percentile score, and the ratio score, using 
the mean as the base, of each of the following pairs of scores. The 
first one of each pair is a score in the column headed ^Tast'^ and the 
second in that headed ^‘Aver.^’ in Part (a) of Exercise 3 on page 80. 
From the comparable scores found, determine which of each pair is 
relatively higher with regard to its own group. 

(a) 132, 115; (6) 124, 106; (c) 112, 92. 

13. Do the same with these scores from Part (6) of Exercise 3 on 
page 80. 

(a) 95, 100; (b) 79, 86; (c) 103, 109. 
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CHAPTER XIV 


Sampling 


Nature and use of sampling 

All operations with data of any sort may be classed as belong- 
ing to one of two general types: either the data involved in- 
clude all of the sort in which the workers are interested; or they 
are a sample of a larger body of data which it is impossible or 
impracticable to include in its entirety. Especially in research, 
as contrasted with routine dealing with facts, is the latter often 
true. In many cases the same data may be employed in either 
way, or in both. Thus, if the heights of a class of forty children 
are the data used, they may be utilized either to determine 
desired facts about just that group of children or to furnish 
evidence as to probable facts for a larger group of which the 
group of forty is supposed to be representative. When used in 
the former way, they are frequently referred to as descriptive 
statistics; when in the latter, the drawing of conclusions about 
the larger group is often called statistical inference. In the 
latter case, and all others in which data from a sample are con- 
sidered indicative of facts concerning the total group it repre- 
sents, there are likely to be errors due to the fact that the 
sample is not perfectly representative of the whole group, or 
population or universe^ as it is often called. 

The terms statistic and parameter are so frequently used in 
this connection that readers should know them. Each term 
has other meanings, but as employed here a statistic is a meas- 
ure, such as a mean, a median, a standard deviation, a coefficient 
of correlation, and so on, computed from a sample; whereas 
a parameter is a similar measure of a whole population. In 
other words, a statistic is an actually secured measure subject 

m 
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to error when considered as a measure of a universe; whereas a 
parameter is a true measure thereof.* 

The whole theory of sampling errors has to do with variable 
rather than systematic errors. In other words, it is based upon 
the supposition that samples are chosen by some random or 
purely chance method. A random method is one according to 
which every case in the population sampled has an equal chance 
of being chosen. It eliminates all intentional bias and generally 
most unintentional bias also. The theory of sampling at- 
tempts to describe the extent to which such a random sample 
is, or is not, truly representative of the population from which 
it was chosen. This chapter deals with the estimation of the 
size and frequency of errors of sampling or, in other words, with 
the reliability of measures secured from samples when taken as 
indicative of similar measures of universes. Untrained workers 
often commit the mistake of computing errors of sampling 
when data from the total population have been used. In such 
instances, no sampling has occurred and consequently no such 
errors can be present. 

Selecting samples 

In selecting samples, especially small ones, workers should 
observe certain precautions which tend to prevent bias rather 
than rely entirely upon chance. These precautions are of two 
chief kinds : those having to do with the way in which tentative 
samples are chosen, and those concerned with checking their 
representativeness before employing them. The basis of select- 
ing samples should be one that has no correlation with the 
characteristics to be measured. In judging this point workers 
should be critical and should look beneath the surface. 

If the population to be sampled contains a number of groups 
or subdivisions of which the sizes are known, it is better to draw 
a random sample proportional to the size of the group from 
each, and then combine these, than to select a single random 
sample from the whole population. This method is sometimes 

^ To indicate that a measure applies to a universe rather than to a sample, a 
curl (^) may be written above it. For example, a stands for the standard 
deviation of a universe, not of a sample. Other methods of distinguishing be- 
tween them are sometimes employed, but this appears to be the most satisfactory. 
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called sampling hy subdivision or by stratification. For exam- 
ple, if the population is known to contain 54 per cent girls and 
46 per cent boys and a 10 per cent sample is to be taken, it is 
better to choose 10 per cent of each sex group by some random 
method than merely to select 10 per cent of the total number 
by the same method. If this procedure has not been followed 
at first, after a tentative sample has been picked from the whole 
population it should be checked to determine if it represents the 
known subdivisions equally well. 

To illustrate sampling further, let us suppose that a superin- 
tendent of schools of a populous county wishes to measure the 
general achievement of pupils completing the elementary school, 
but cannot well expend the money and effort necessary to test 
more than one-fifth of such pupils. A good method of selecting 
those to be tested would be to write the names of all graduating 
pupils upon similar slips and, after mixing them thoroughly, to 
select one-fifth by a purely chance procedure. A better one 
would be to arrange the names of all alphabetically and take 
every fifth one. Even better would be to apply the same method 
by schools or districts rather than to the county as a whole. 
To call for either pupils or teachers to volunteer, to take the 
one-fifth seated nearest the front, to test all in a few schools 
or systems containing the desired fraction, and to ask principals 
to designate pupils, are examples of unsatisfactory procedures 
likely to produce biased samples. Intermediate between good 
and bad would be to select at random one-fifth of the schools 
and to test all graduating pupils found in them. 

An excellent sampling procedure, but one little used in educa- 
tional work, is to employ a table of random numbers. A good 
discussion and illustration of this procedure, with such a table, 
is given by Lindquist.® A more complete table of this sort 
may be found among those of Fisher and Yates.* 

It is rarely possible to determine whether or not a sample is 
biased merely by examining the sample itself. If comparison 


* Lindquist, E. F., Statistical Analysis in Educational Research. Pages 25-29, 
262-264. Boston: Houghton Mifflin Company, 1940. 

* Fisher, R. A., and Yates, F., Statistical Tables for Biological, Agricultural and 
Medical Research. Pages 82-87. London: Oliver and Boyd, 1938. 
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with the population, as suggested above, is impossible or 
impracticable, the best procedme is to choose, in the same 
manner, but independently, several samples of equal size and 
then compare them. K they agree in essential characteristics, 
with only insignificant differences, any one, or preferably all 
combined, is probably satisfactory. They may be biased in 
the same direction, but this is unlikely. If they do not agree, 
either the size of the samples should be increased to a point at 
which they do, or else some other method of random selection 
should be employed. 

A point often neglected in the selection of samples and the 
determination of the reliability of measures derived from them 
is that the individuals composing a random sample should not 
be members of a relatively few integral groups, such as classes. 
If they are, the sample is likely to be much less representative 
of the universe than if they are not. For example, a sample 
of 20 classes of twenty-five pupils each is much less likely to be 
representative of 10,000 high-school freshmen in a large city 
system than is one of 600 chosen by taking every twentieth 
one alphabetically. This is true even if the 20 classes are a 
thoroughly random selection of all such classes. It is almost 
certain that the pupils in each of the 20 classes will have certain 
common characteristics that affect the sample. Such a sample 
will tend to be intermediate in representativeness between one 
of 600 pupils properly chosen and one of twenty pupils, also 
properly chosen. 

Sampling theories 

The classical or traditional method of estimating errors of 
sampling, often called the large sample theory, is based upon the 
assumption that such errors approximate normality in their 
distribution, with a mean at 0. Since most of the samples em- 
ployed in educational work are of such size, greater than 30 or 
40, as technically to be called large rather than small and since 
the distributions of errors for such samples for most common 
measmes usually do not vary greatly from normal, this theory 
yields more or less satisfactory results in many cases. On the 
other hand, we do use some small samples, and even for large 



SAMPLING 


231 


ones the distributions of errors for some measures are not nor- 
mal; hence better means of estimating sampling errors are 
often needed. The small sample theory has attempted to 
supply this need and has furnished formulas and interpretations 
based thereon that are, in some instances, distinctly superior 
to those previoiisly employed. In general, the small sample 
theory is applicable to large samples, but when so used ordi- 
narily yields conclusions very similar to those given by the 
large sample theory. 

Measures of error and their Interpretation 

Except in instances so rare as to be negligible, it is impossible 
to determine the sampling error in any single specific measure. 
The best that can be done is to determine the probable distribu- 
tion of errors in all similar measures and to estimate the chances 
that the error in any one of them is of a given magnitude. For 
example, if the mean score of forty pupils is taken as represent- 
ing the mean of a larger group from which the forty were 
selected, the error of sampling in that particular mean cannot 
be found. If it could, the mean would be corrected and the 
true mean found and used. Instead, the probable distribution 
of errors in means of all possible similarly chosen samples of 
forty pupils each can be estimated, also the chances that the 
error in the particular mean in question is more or less than a 
given size, or between certain limits. 

Since distributions of sampling errors can be estimated, the 
conventional method of describing them is the same as for 
any distribution, by stating a measure of central tendency and 
one of variability. Moreover, since positive and negative 
errors of sampling generally balance each other, the mean may 
be taken as zero. Therefore stating a measure of variability 
alone is usually sufficient to describe a distribution of errors. 
The measure of variability most often employed for this purpose 
is the standard deviation, but the median deviation and, rarely, 
the mean deviation are also used. In this connection they are 
called the standard error, symbolized by a or 6 (epsilon)^; the 

* a rather than c is the common symbol for standard error, but in order to avoid 
confusion with standard deviation the writer prefers, and will use, *. 
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probable error, {PE)\ and the mean error, i^ME). These 
measures of error are interpreted just as are the corresponding 
measures of variability, as explained in Chapter VI. In other 
words, if, as usual, the distribution of errors is assumed to be 
normal, the tables in the Appendix may be applied to them. 

In any such application and consequent interpretation, 
workers should remember that the chances found are those that 
a statistic or obtained measure is within the corresponding dis- 
tance of a parameter or true measure, not vice versa. In many 
instances a fundamental difference in interpretation is involved. 

Classical formulas for errors of sampling 

The classical or large sample theory formulas for the errors 
of sampling in a number of the measures presented in this vol- 
mne are given in Table XLI. For some measures no such 
formulas have been derived. The first column of the table 
lists the measures or statistics for which error formulas are 
given; the second contains the formula for the standard error 
of each; and the third, that for the probable error. Just as 
MdD = .6745<r, so PE = .6745e, but the latter’s value has 
been computed and is given in most cases. Many of the 
formulas are simplified approximations with such terms 
eliminated as largely increase their difficulty but slightly affect 
their values. In a few instances they cannot be simplified so 
as to be easily usable without too great sacrifice of accuracy; 
hence they are ^ven in forms rather laborious to employ. 

Although the error formulas in Table XLI are those con- 
ventionally given in connection with large samples, they are 
subject to a number of limitations even when so used. Some 
of these can be entirely or largely removed by appropriate 
modifications or corrections; others cannot. One of the most 
important is that the measures of variability, such as <r, Q, and 
so forth, which so often appear in the formulas should be 
parameters or measures of a universe rather than statistics or 
measures of a sample only. In most instances, however, only 
the latter are available; hence insofar as they are inaccurate 
estimates of the parameters, the results yielded by the given 
formulas are in error. In some instances corrected statistics 
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In these cases the standard error alone is given. It should, as always, be multiplied by .6745 to obtain the probable error. 
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or estimated parameters can be used instead of obtained 
statistics. 

The one correction that can be made most frequently applies 
whenever <r or MdD appears. If Sd* or S/d*, as the case may 
be, is divided by iV — 1 instead of by N, a better estimate of 
the standard deviation of the whole population is secured. If 
the ntimber of cases is very small, still further correction is 
needed. As a compromise between exact accuracy and ease 
of computation, it has been suggested that N be used when the 
number of cases exceeds 30, INT — 1 when it is between 20 and 30, 
N — 2 when it is between 10 and 20, and N — S when it is 
less than 10. The writer, however, recommends using N — 1 
for samples containing more than 30 cases as well as for those 
with fewer than that number. 

Another serious limitat’on is that the formulas, as given, 
include more than sampling errors. If the data are not per- 
fectly reliable — ^in other words, if a repetition of the procedure 
by which they were obtained would not yield corresponding 
data for the same individual cases — errors of measurement are 
also present. The formulas in Table. XLI include both types 
of errors. If the coefficient of reliability of a set of data is 
known, an error of sampling may be secured by multiplying a 
formula in the table by the square root of that coefficient, or 
The corresponding error of measurement may be found 
by multiplying the tabled formula by Vl — r,,. For example, 
the formula given for the standard error of the mean, is 

therefore the standard error of sampling of the mean is 


Vn 


and the standard eror of measurement of the mean is 


g-v^l — ni 


If the data are perfectly accurate, so that r,i = -f-1.00, then 
the error of measurement becomes 0 and that of sampling is 
given by the formula in Table XLI. 

Another correction that makes for greater exactness is to 
substitute N — n (n being the ntunber of variables or attri- 
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butes concerned) for N. Thus for a measure of central 
tendency or variability involving a single distribution, N — 1 
is used; for one of simple correlation or regression involving 
two, N — 2; and so on. Making this correction amounts to 
dividing by the number of degrees of freedom instead of by 
the number of cases. 

Most users of sampling error formulas overlook or neglect 
the fact that, as generally given and employed, they assume 
an infinite universe rather than take account of the number of 
cases in the universe. The number of cases in a universe is 
not always known, but when it is the formulas for errors of the 

cr 

mean and all others that contain or any multiple thereof 
should be multiplied by 

in which n is the number of oases in the sample and N that in 
the universe. A close approximation may be obtained by 
employing 




j • Tt 

which is sometimes written v 1 — p,p being equal to ^ or, in 


other words, being the per cent that the sample is of the uni- 
verse. Inspection of the formula shows that the larger the 
per cent of the universe included in a sample, the smaller the 
error of sampling. For example, if a sample contains 1 per 
cent of the cases in a universe, the multiplier is approximately 
.995; if it contains 2 per cent, it is .99; if 5 per cent, .975; 
if 10 per cent, .95; if 25 per cent, .87; if 50 per cent, .71; if 
75 per cent, .50; and so on until, if it contains 100 per cent, 
the multiplier is .00, since then there can be no error of sampling. 

The formulas for errors in averages overestimate the sizes 
of such errors if distributions of data are leptokurtic and 
underestimate them if they are platykurtic. The same is 
true of all error formulas given in Table XLI with regard to 
the shapes of the distributions of the errors themselves rather 
than of the data to which they apply. 
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Computing and interpreting errors of sampling 

To illustrate the computation and interpretation of errors 
of sampling, some of the data from the often-used set of 44 
cases may be considered as a sample of a larger niunber and 
may be employed. For them, M = 253.64 and <r = 52.96. 

Therefore, according to its conventional formula. 


^ = 7.98. 
V44 


VN 


If, as suggested above. A'' — 1 is used instead of iV,® 


52.96 

- Vis' 


If the coefficient of reliability of the data were known, the value 
just given should be multiplied by V^i. Since it is not known, 
it will be assumed to be -1-.90. Employing this value, we have 
8.08\/.90 = 7.67. If we further assume that the 44 cases 
constitute 5 per ce nt of the total group, we should multiply 
7.67 by Vl. — .05, which gives a result of 7.47, a better value 
than any of those preceding it. 

As previously suggested, this or any other such error may be 
interpreted, as may any similar deviation, by the use of the 
appropriate one of the tables in the Appendix. That is, the 
standard error of any statistic of a random sample is merely 
the standard deviation of a distribution of that statistic for an 
infinite niunber of random samples of the same size. Since in 
this case Cu = 7.47, there are approximately 68.27 to 31.73, or 
2.15 to 1, chances that the mean actually obtained is within 
7.47 of the true mean of the whole population from which the 
sample was drawn; 95.45 to 4.55, or 21 to 1, chances that the 
obtained mean is within 2 X 7.47, or 14.94, of the true mean; 
and so on for any multiple or fraction of Cu- 

* iV — 1 should be substituted for N in the calculation of <r rather than in that 
of e. Since, however, the result is the same whether a is computed with the use 
of N and then € with JV — 1, or o- with N — I and e with N, it is customary to 
introduce AT — 1 with e. If it is used in computing a of the data employed, 
<r = 53.58 rather than 52.96 and 

53.58 

tjii = “7=- « 8.08, 

V44 


as above. 
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Sometimes a somewhat-different approach is convenient. 
What is desired may be what per cent of obtained measures — 
in this instance, means — is within a specified number of points 
of the true mean. This may also be stated as the chances that 
any one of the obtained measures is within the same number of 
points of the true mean. To do this, the number of points is 
divided by the standard error and the result, which is merely 
the distance expressed in standard error or deviation units, is 
looked up in the table. If the distance is 10 points, for exam- 
ple, 10 is divided by 7.47, the result being 1.34. According to 
the table, 2 X .4096, or about 82 per cent, of the area of a 
normal curve is within 1.34e of the mean; hence 82 per cent of 
the obtained means will probably be within 10 points of the true 
mean. Stating it otherwise, the chances are 82 to 18, or about 
4.6 to 1, that any one obtained mean is within that distance. 

Still different is the situation in which we wish to know the 
chances that the true mean is as small or as large as a stated 
value. We may, for example, want to know the chances that, 
for the data just employed, the true mean is not 270 or larger. 
The procedure is to divide the difference between the true and 
the obtained means by its standard or probable error and find 
the last-column entry corresponding to the result. In this case, 
270 — 253.64 = 16.36, and this divided by 7.47, gives 2.19. 
The last-column entry corresponding to this in the first column 
of the standard deviation table is approximately 69; hence 
there are 69 chances to 1 that the true mean would not be 270 
or larger. 

Similarly, by the use of either the standard or the probable 
error, the reliability of other measures when they are regarded 
as statistics or sampling measures may be determined and 
expressed. The practice of stating the standard or probable 
error with the measure to which it refers should be much more 
frequent than it is. When the probable error is so stated, it 
is conventionally written after the measure with a plus-or- 
minus sign between them. For example, PEu for the data 
just used is .6745 X 7.47 = 5.04; hence the mean may be 
written 253.64 ± 5.04. There is no corresponding generally 
accepted method for the standard error. 
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The significance of differences 

The reliability or significance of differences is frequently 
important, particularly in educational experimentation. Such 
questions as whether a difference in the achievements of two 
groups wherein different methods were employed indicates a 
real difference or probably is due to chance, whether a single 
pupil’s gain in weight or speed of reading or ability to solve 
problems in arithmetic is significant or not, and so on, often 
arise. Sometimes not merely differences but differences be- 
tween differences are involved. 

The general formula for the standard error of a sum or differ- 
ence is 

± 2ri2«ie2 , 

in which the subscripts “1” and “2” represent the two measures 
the sum or difference of which is in question. If a sum is 
involved, the last term is preceded by a plus sign; if a differ- 
ence, by a minus sign. If there is no correlation between the 
measures concerned, the last term becomes zero, leaving merely 

<1*2 = "^*1 "t" <2* 

All too often, however, workers have employed this short form 
when correlation is present and the long form should be used. 
The effect of so doing is to yield too small values when sums 
are concerned and too large ones when differences are, with con- 
sequent interpretations of too high reliability of sums and too 
low reliability of differences. 

Since, in actual practice, the reliability~of s ums " is desired 
far less often than that of differences, only the latter will be 
illustrated. Let us suppose that we wish to determine the 
significance of the gain in mean ability to solve a specified type 
of arithmetic examples resulting from practice given a group of 
34 pupils, and that their mean score before the practice was 
14.2; that after it, 16.4; the corresponding standard deviations, 
3.1 and 3.8; and the coefficient of correlation between the two 
series of scores, -f .73. First, the standard errors of the means 
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miist be found. Using the subscript “1” to refer to the scores 
before the practice and “2” to refer to those after, we have 

31 j 3.8 

~ -3^ €Af, = —7= = .66. 

* V 33 * ^33 

Therefore 

+ .66* - 2. X .7-rx .54 X .66» = .45. 

The difference between means is 16.4 — 14.2 = 2.2, which 
divided by .45 gives 4.89, the size of the difference in terms of 
its own standard error. This quotient is sometimes called the 
significance ratio. The last column of the standard deviation 
table in the Appendix gives the chances to 1 that the difference 
is real — ^in other words, that the true mean, or mean of the 
universe, after the practice is greater than the mean before the 
practice. For <r or e equal to 4.89, the chances are approxi- 
mately 2,000,000 to 1. 

The formula for the error of the sum or difference of two 
sums or differences is the same as that for a simple sum or 
difference, except that the terms under the radical are those 
for sums or differences rather than for simple measures. Thus, 
employing the subscripts “1” and “2” for two measures of 
which differences are taken and “3” and “4” for two others, 

«(l-2)-(3-4) = 2 "H <3-4 — 2r(l_2)(3_4) ei-2<3-4 . 

This may be expanded and written in terms of standard errors 
of simple measures rather than of differences by substituting 
for ei _2 and € 3-4 their values, which are, of course, 

«i -2 = + 4 — 2 ri 2 ei €2 and 63-4 = v'el + 4 — 2 rutif 2 . 

To illustrate such a case, let us assume that for another 
group, containing 39 pupils, given a different type of practice, 
the means before and after were 13.6 and 15.3; the standard 
deviations, 2.9 and 3.7 ; and the coefficient of correlation, H-.68. 


® Althouj^h the coefficient of correlation given in the text — .73 — is that between 
th^ series of scores, and the formula for cat i - Af, calls for that between the means, 
the former is used because it can be shown that the two are the same. 
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Using the subscript “3” to refer to measures before the practice 
and “4” to those after, we have 

_ 2.9 _ . _ 3.7 „ 

“ Vhs ~ “ VZ8 ~ 

whence 

= V.47* + .602 _ 2. X .68 X .47 X .60 = .44. 

Since there cannot be correlation between two different groups, 

*(Mi— jtfj) — (Ms— jtf4) ~ jfj) + = ■n/.45* + .44* = .63. 

According to the suggestion at the end of the last paragraph, 
the same result can also be obtained thus: 

V.54*+.66*+A7*+.60*-2. X .73 .47 X .60 = .63. 

The difference between differences is (16.4 — 14.2) — 
(15.3 — 13.6) = 2.2 — 1.7 = .5. Dividing this by its standard 
error, .63, we get .79, the difference between differences ex- 
pressed in terms of its standard error. For this value in the 
first column of Table XLIV the corresponding entry in the last 
column is, by interpolation, approximately 3.7; hence there 
are that many chances to 1 that there is a real difference in the 
direction indicated, that is, that the gain by the first type of 
practice is larger than that by the second. 

The name critical ratio has been given to the result secured by 
dividing a difference between means by its probable error. 
Sometimes also 

difference 

C/xt “ • 

fdiff 

and sometimes it is employed to refer to differences between 
other measures than means. The writer recommends that ft 
be used only as 

difference 
PEdv ’ 

but in connection with all differences. 
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The experimental coefficient method 

A method of interpreting the significance of differences that 
has received considerable use is that based upon the experi- 
mental coefficient. This is defined by the formula 

_ difference difference 
2.78^(^ 4l2P^- 

The multiplier in the denominator was so chosen that when 
EC = 1.00 the chances that a difference is significant are 369 
to 1, which McCall, who suggested the EC, called practical 
certainty. This is at variance with the more usual definition 
of practical certainty as 99 chances to 1, or sometimes as 999 
to 1. The present writer strongly objects to the use of any 
so defined critical point and the consequent dichotomous 
classification of differences as definitely either significant or 
not significant, certain or not certain, reliable or not reliable. 
It seems absurd to consider '99 chances to 1 as certainty, but 
not 98 to 1. Instead, the actual chances should be stated in 
each case. Table XLII gives chances of significance for 
various other values of the experimental coefficient. The 

TABLE XLII 

Chances that a Difference is Significant 
Corresponding to Various Values of the 
Kxperimental Coefficient 


Experimental 

Coefficient 

Approximate 

Chances to 1 

.1 

1.6 

.2 

2.5 

.3 

4.0 

.4 

6.6 

.5 

11. 

.6 

20. 

.7 

38. 

.8 

76. 

.9 

160. 

1.0 

370 

1.1 

900 

1.2 

2400 

1.3 

6700 

1.4 

20000 

1.5 

67000 
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expression chances of significance in this connection means just 
the same as in the last section — chances that the difference 
really is in the indicated direction. 

The experimental coefficient method gives just the same 
chances as that presented in the last section, but requires the 
additional step involved by introducing the factor 2.78 or 4.12, 
hence seems to the writer less deserving of use than the other 
method. In the last example of that section, for example. 


EC = 


.5 

2.78 X .63 


= .28. 


For this value. Table XLII yields chances of about 3.7 to 1, 
the same as were already determined. 


Small sample theory 

The most commonly applied small sample theory is that 
which makes use of t instead of e or PE. Originally the use 
of t was limi ted to situations involving differences between 
means and its formula was 

^ ^ M -M 

'tii 

Later, however, it came to be employed in connection with 
other differences also. It may be defined, therefore, as the 
difference between a statistic and the corresponding parameter 
divided by the estimated standard error of the measure con- 
cerned. Its distribution is leptokurtic in shape, although as the 
number of cases becomes large it approaches normality; hence 
it requires a special table for its interpretation in terms of 
probability or significance. For large numbers of cases it 
yields interpretations quite similar to those obtained by large 
sample theory; for very small numbers it yields markedly 
different interpretations. 

Values corresponding to those in the last four columns of 
a <r or € table of the normal curve may be found for t. Since, 
however, they differ according to the number of degrees of 
freedom, they require much more elaborate tables. In default 
of these, condensed tables for t are given in several forms. 
Most of them ^ve values for only certain stated- levels of 
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significance or critical points. The most frequent are chances 
of 20 to 1, often called significant, and of 100 to 1, called very 
significant. This practice is one that the writer deplores. 
Instead, he prefers a table containing entries corresponding to 
values of t and numbers of degrees of freedom. The entries 
may be similar to those in any one of the four columns men- 
tioned above. Probably entries which correspond to those in 
the next-to-the-last colmnn — that is, show the ratio of area 
within a distance of t from the mean to the area farther than 
that — are most convenient. In other words, they show the 
chances to 1 that a statistic does not differ more than t from 
the corresponding parameter. Just as for a normal distribu- 
tion, so here the chances that a statistic is as far or farther away 
from the parameter and in the same direction as the difference 
equals 1 more than twice the chances mentioned in the preced- 
ing sentence. 

Table XLIII has been prepared for use in this connection. 
It gives, for certain values of t and munbers of degrees of 
freedom, the chances to 1 that a statistic does not differ from 
the corresponding parameter by more than the value of t. 
The entries are accurate to two significant figures only and, to 
economize space, are not given when 100 or greater. Its use 
will be illustrated in later paragraphs. 

In actual use t is often employed to find the chance that, if 
an assumed or hypothetical value of a measure is the true one, 
an actually obtained one would be found. For example, if the 
mean of the population of which the previously used distribu- 
tion of 44 cases is a sample is 260, what is the chance that an 
obtained mean would be as far away as is 253.64? 260 — 253.64 
= 6.36, which divided by 8.08, the value already obtained for 
estimated i^, gives t = .79. For t — .79 and 11 degrees of 
freedom the interpolated tabular entry is 1.27. In other 
words, the chances are 1.27 to 1 that an obtained mean will not 
be less than 253.64 if the true mean is 260. 

For comparison, results by large sample theory will be found. 
The formula for the standard error of a difference applies. 
Thus 


fus — y/ (m + 



TABLE XLIII 

Chances to 1 that A Statistic Does Not Diffeb from the Corbesponding Parameter by More than t 
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* The blank spaces represent entries of 100 or more. 
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b- «0 «0 <N 

to o qa 

(M' d 00 00 (N * * 

Tt< o gS <o qa 

i>csiqiddtdi>q>* * 

cO’-t-^»oot>ooqs 

»^io»^deodQdt^t^qs* * 

coeoTf<’«t<totDgat>.Qoqa 

toqa(>id»-4di-Ht>;csdo6i>^i>^* * * * * * 

(N<NeoeO’^-^iOtocDt^b»QOq5 

qi<N^t^q>c4dqacoi>^-Jd»-^i^eoo5to‘c4oa* * * * 

rH(MiMG^(NCOCOCOTf4»tt<tQtOtOtDt-»t>.OOqaqi 

^iododq5rHco'^«doc3ddtoi>dc4»OQO*-5'^tN;^'it* * * * 

^^^^^c^iesiiMesiiMooeococo-rttTyTrTyiOiOiocod 

CO qa »o ^ 

ooooqadi-J^cicocO'tiiioddt'IoooadjiHNco^aSSii’S'SlS 

>1— <^C4C^C^C4^CM04CO>0<000 

to b- qa o C<» CO to 00 O ^ CO. ''IJ «0 !>; oa O csj CO ao «q 00 O to . . . . 

cococoTjt^^^'v^i'i^^totoiotototqtodcddcdddb^ooo^d^ 

xc3ao*-H04eo'^to<et^oooap»HOieo^. to. «>i>.oqoapooopp 

c4c4cocococococoeocoeocO'<4t’^^^'^i^'^'ti5'^<'45todt^o6oap 


* The blank spaces represent entries of 100 or more. 
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but, since M has no error, this becomes 

= -s/el= 

Therefore the difference divided by its standard error equals 

M - M 

which is the same as f or .79. The interpolated entry in the 
next-to-the-last column of the first table in the Appendix for 
€ = .79 is 1.33 and that in the last column is 3.66. These do 
not differ greatly from the corresponding values of 1.27 and 
3.54 found by the use of t. In general, the smaller the values 
of t and of the difference divided by e, the less do the chances 
based upon them differ, and vice versa. 

The procedure based upon t may also be employed to deter- 
mine chances of difference between two obtained means or 
other measures as well as between one obtained and one 
assiuued true one. In this case 


A f 1 - Mi 

ejifj-jifjj ’ 


that is, the difference between the means divided by the stand- 
ard error of that difference. Applying it to the data used on 
page 242, we have 


16.4 - 14.2 
.45 


4 . 89 . 


For this value of t and 33 degrees of freedom the entry that 
would be in Table XLII if it were extended is evidently far 
larger than 100, which indicates practical certainty as to the 
difference being significant. 


Degrees or levels of confidence 

The use of degrees or levels of confidence in stating the 
significance of results has been employed increasingly in recent 
years. Although the present writer does not approve it, he 
will explain it briefly. A degree or level of confidence is the per 
cent of chances that a result is due to chance rather than to 
dependable causes: For example, if there are 90 chances to 10 
in favor of dependability, the level of confidence is 10 per cent; 
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if 97 to 3, it is 3 per cent; and so on. Tables for the inter- 
pretation of sampling errors may give values at given levels of 
confidence, such as the 5 per-cent, the 2 per-cent, and the 1 per- 
cent level. Such tables may then be used, more or less in 
advance of the determination of measures, to indicate their 
sizes for certain degrees of confidence. 

To illustrate the usage just mentioned, we may employ the 
first data used on page 242 in connection with the difference 
of two means. The standard error of that difference, 
was found to be .45. Therefore, if the difference is at the 
5 per-cent level of confidence, it must be as many times its 
standard error as corresponds to chances of 95 to 5, or a ratio 
of 19 to 1. By interpolation, a <r distance of 1.64 is found to 
correspond to an entry of 19 in the last column of the standard 
deviation table in the Appendix. Since 1.64 X .45 = .74, the 
difference between the two means concerned must be .74 in 
order that there be 95 chances to 5, or 19 to 1, that the one 
having the larger obtained value also has the larger true value. 
In other words, the 5 per-cent level of confidence in this situa- 
tion requires a difference of .74. Similarly the 1 per-cent level, 
or 99 chances to 1, requires a difference of 2.32 X .45 = 1.04. 
Another way of stating these facts is that if the difference is .74 
there are 5 chances out of 100 that the larger obtained mean is 
larger because of errors of sampling and that if it is 1.04 there 
is 1 chance out of 100 that this is true. 

The null hypothesis 

Sometimes the approach is through what is known as the 
null hypothesis. This was originally employed to designate any 
exact hypothesis which one may wish to disprove, but is fre- 
quently limited in use to situations in which the hypothesis is 
that the parameter in question is zero. If this hypothesis is 
rejected, the statistic is significant; that is, it signifies that the 
parameter is not zero. In this connection the expression level 
of significance is employed in the same way as level of confidence 
in the other approach. 

Situations in which the null hypothesis is appropriate are 
those in which the point at issue is the possibility that param- 
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eters are zero rather than the limits within which they lie. In 
such an instance as that used in the preceding section and pre- 
viously, and others wherein differences are concerned, this 
condition is likely to hold. Interest centers in whether or not 
the larger obtained measure signifies the larger true measure 
rather than in the probable limits of the latter. In other 
words, the direction rather than the size of the difference is 
important. This is what was determined in the preceding 
section. 


Errors of statistics from matched samples 

Not infrequently groups of pupils employed in experimental 
studies are matched. When this is done, the effect is the same 
as if an element of correlation were introduced; therefore, 
formulas for errors based on data secured from such groups 
must take account of this fact. Instead of u sing th e ordinary 
error formulas unchanged, the multiplier Vl — r* should be 
introduced if the g roups are matched on a fallible or unreliable 
criterion; Vl — r, if they are matched on an infallible or true 
criterion. In either case r is the coefficient of correlation be- 
tween the matching element for the successive groups and is 
found by assuming that each pair or more of matched cases, 
one from each of two or more groups, may be considered as 
scores made by the same individual. Means are more fre- 
quently concerned than other statistics, but the application of 
the procedure is not limited to them. 


The reliability of coefficients of correlation 

Although the reliability of the coefficient of correlation is 
generally estimated by comparing it with its standard or 
probable error as found by the formula in Table XLI, this 
practice is less valid for r than for most of the other measures 
for which error formulas are given in the same table. For low 
values of r from large samples the results are approximately 
correct, but when r is large or the sample small they are liable 
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to be seriously inaccurate. A better formula for ^ than 


is 


1 - 1 - 

Vn 


LuT"- a . l lr^eor, \ 
V'N^i V 4(A - 1)/ 


in which r«», is the value given by the second formula on 
page 133. If r = .50 and N = 20, 


which is slightly larger than the .17 which the formula from 
Table XLI gives. 

A different approach sometimes useful is by means of Table 
XLIII. If t is found by the formula 


t = 



the corresponding entry in Table XLIII is the probability that 
if the true correlation is zero a value as large as the absolute 
value of the obtained r, or larger, would be secured by pure 
chance. For example, if r = .50 for 20 cases. 


= ^-^20 


- 2 ) 

50 * 


2.45. 


The interpolated entry in Table XLIII corresponding to 
t = 2.45 and 18 degrees of freedom is approximately 40. In 
other words, the chances are 40 to 1 that a coefficient exceed- 
ing d=.50 would not be secured if the true value of r were .00. 
The chances against a value of +.50 or more are 2 X 40 + 1, 
or 81, to 1. 

Except when N is quite small, the formula 

VA-l’ 

which is that for the standard error of r when true r = .00, 
may be used for the same purpose. In this case it gives 

1 
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Dividing r, .50, by this gives 2.18, the value of r in terms of its 
standard error. Using this as a first-coliimn entry in the first 
table in the Appendix, we find a corresponding last-column 
entry of about 69, the chances to 1 that a value of -1-.50 or 
more would not be obtained. The discrepancy between 69 and 
81 is due largely to there being only 20 cases concerned. 

More elaborate and exact procedures for estimating the 
reliability of a coefficient of correlation have been proposed, 
but do not appear appropriate for presentation in an elemen- 
tary text. Probably the best involves changing r into another 
statistic, z, which has an approximately normal distribution 
although r does not. 

EXERCISES AND PROBLEMS 

Since the errors of almost all the measures called for in the exercises 
and problems at the ends of previous chapters can be found, similar 
computations will not be asked for here. It is recommended that the 
standard and probable errors of some of each type of measure for 
which error formulas are available be found, both by the formulas in 
Table XLI and with the appropriate corrections and modifications 
thereto suggested in the text. The exercises below call for certain 
applications of these formulas and uses of the procedures presented 
later in the chapter. 

1. If M = 48.6, <r = 5.4, and N = 65, what is the chance that M is 

(а) within 2 points of M? 

(б) within 10 points of Af? 

(c) more than 5 points from M? 

(d) at least 3 points larger than M? 

(e) at least 8 points smaller than M? 

2. If Qi = 64.0, Qs = 82.4, and N = 230, what is the chance that 

(o) Qi and Q» differ from their respective parameters by less 

than 4 points? 

(5) Qi and Qi differ from their respective parameters by more 
than 6 points? 

(c) Qi is at least 3 points more than Qi? 

(d) Qi is at least 8 points less than Qs? 

(e) Q is within 2 points of Q? 
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3 , If an experimental group of 33 pupils has a mean score of 74 
on an initial test and of 97 on a final test, with accompanying standard 
deviations of 10.2 and 13.6, respectively, and a coefficient of correla- 
tion of .72; and a control group of the same size has corresponding 
means of 72 and 92, with standard deviations of 9.6 and 12.8, and a 
coefficient of .76; what is the chance that the greater gain of the ex- 
perimental group is significant? 

4 . If the following data were obtained from an experiment, what 
is the chance that the difference in gains is significant? 

Initial Final 



N 

Md 

Q 

Md 

Q 

r 

Experimental 

40 

35 

6.3 

64 

8.7 

.68 

Control 

133 

38 

7.2 

62 

8.8 

.73 


6, Find the chances for the data in Exercises 3 and 4 by the ex- 
perimental coefficient method. 

6. Find the chances for the data in Exercise 1 if JV is 12. 

7 . Find the chances for the data in Exercise 2 if iV* is 16. 

8. Find the chances for the data in Exercises 3 and 4 by small 
sample procedure if both values of N in Exercise 3 are 10 and if its 
values in Exercise 4 are 8 and 15, respectively. 

9. Find the chances for the data in Exercise 3 if the data are 
matched 

(а) on a fallible basis with r = .84. 

(б) on an infallible basis with r = .96. 

10 . If r = .71 for a sample of 85 cases, what is the chance that it 

(а) differs from r by more than .10? 

(б) differs from r by more than .04? 

(c) differs from r by less than .02 ? 

(d) is at least .03 smaller than f ? 

(e) is at least .06 larger than r? 

11 . If r = .32 and N = 17, what are the chances that r is not .00? 

12 . If r = — .27 and N = 40, what are the chances that r is not .00? 
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Tables of the Normal Curve 

Tables of the normal curve and its functions may be pre- 
sented in various forms, but that in which other functions are 
tabled with respect to the standard or median deviation is 
most frequently useful; therefore it is followed here. More 
detailed tables in this form and others organized with respect 
to area, height, or some other functions may be foimd in some 
of the references given in Chapter I and elsewhere. 

The first of the two tables presented here gives the heights, 
included and excluded areas, and two ratios of areas, corre- 
sponding to each value of the standard deviation or error from 
.0 up to 5.0, by intervals of .1; the second does the same for 
the median deviation or probable error from .0 up to 6.0. The 
values of the two deviations just referred to are in the first 
columns of the tables. 

The second columns contain the heights, h, or ordinates of 
the normal curve at the given deviation distances. These are 
given in term of a maximum height, or central ordinate, of 
1.00, for which the area under the curve is or 2.5066. 
Some tables of the same function present them in terms of an 

area of 1.00, for which the maximum height is or .3989. 

When so presented, the symbol z is commonly employed for 
them. Just as for the maximum height, so for any other, 
z = .3989/i, hence can readily be obtained from the h entries. 

Each entry in the third colximn is the fraction of the whole 
area under the normal curve included between an ordinate at 
the corresponding distance and the maximum ordinate, which, 
of course, is at the mean. Each entry in the fourth column is 
the fraction of the area beyond the same ordinate, or excluded 
by it. The sum of the two is always .5000. 

S57 
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Each jfifth-column entry is obtained by dividing the corre- 
sponding fourth-column one into that in the third colmnn. In 
other words, it is the ratio of the area closer to the mean than 
the corresponding distance in the first column to the area 
farther from the mean. In appropriate situations this entry 
represents the chances to 1 that a single case is within rather 
than without an ordinate at the corresponding distance. 

Finally, each entry in the last column is 1 minus the entry 
in the fourth divided by that in the fourth. It is also equal to 
one more than twice the fifth-column entry. It is the ratio of 
the area of the larger of the two parts into which the ordinate 
divides the whole area under the curve to that of the smaller. 
Therefore it is the chances to 1 that a given case is not beyond 
the corresponding distance in one, but not in both, directions. 
In other words, it is the same as the chances arrived at by the 
experimental coefficient procedure. 

As an example, the row of entries beginning with a a distance 
of 1.5 in the first table may be used. The height of the curve, 
or length of the ordinate, 1.5<r in each direction from the mean 
is .3247 of its height at the mean, that is, of the maximum 
ordinate. If z rather than h is desired, it is .3989 X .3247 = 
.1295. The area included between an ordinate at ±1.5(r and 
the mean is .4332 of the total area and that beyond ±1.5<r is 
.0668 of the total area. If the area between and -f 1.5<r 

is desired, it can be obtained by doubling .4332, the result 
being .8664; if the area farther than 1.5<r from the mean in 
both directions is wanted, .0668 should be doubled, giving 
.1336. The ratio of .4332 to .0668, or of .8664 to .1336, is 
approximately 6.5, which means that 6.5 times as much area 
is within 1.5<r as is beyond it. In other words, the chances are 
6.5 to 1 that any single case is within a distance of 1.5(r from 

the mean. Finally, or 2 X 6.5 + 1 = 14.0, the ratio 

.Uddo 

of all the area on the larger side of an ordinate at ±1.5(r from 
the mean to all that on the smaller side. This may be defined 
also as the chances to 1 that a single case lies below 1.5<r 
above the mean or above 1.5or below the mean rather than the 
opposite. 
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TABLE XLIV 


Heights, Areas, and Corresponding Ratios or Areas of a Normal Dis> 
TRiBUTioN Corresponding to Standard Deviation or Error Units 


<r or 6 
Distance 
from 
Mean 

K 

Height or 
Ordinate at 
Given <r or e 
Distance 

Area Between 
Ordinate at 
Given a or e 
Distance 
and Mean 

Area Beyond 
Ordinate at 
Given a or e 
Distance 
from Mean 

Ratio of Area 
Between 
Ordinate 
and Mean to 
Area Beyond 
Ordinate 

Ratio of Area 
on Larger 
Side of 
Ordinate 
to Area on 
Smaller Side 

.0 

1.0000 

.0000 

.5000 

.00 

1.00 

.1 

.9950 

.0398 

.4602 

.09 

1.17 

.2 

.9802 

.0793 

.4207 

.19 

1.38 

.3 

.9560 

.1179 

.3821 

.31 

1.62 

.4 

.9231 

.1554 

.3446 

.45 

1.90 

.5 

.8825 

.1915 

.3085 

.62 

2.24 

.6 

.8353 

.2257 

.2743 

.82 

2.65 

.7 

.7827 

.2580 

.2420 

1.07 

3.13 

.8 

.7261 

.2881 

.2119 

1.36 

3.72 

.9 

.6670 

.3159 

.1841 

1.72 

4.43 

1.0 

.6065 

.3413 

.1587 

2.1 

5.3 

1.1 

.5461 

.3643 

.1357 

2.7 

6.4 

1.2 

.4868 

.3849 

.1151 

3.3 

7.7 

1.3 

.4296 

.4032 

.0968 

4.2 

9.3 

1.4 

.3753 

.4192 

.0808 

5.2 

11.4 

1.5 

.3247 

.4332 

.0668 

6.5 

14.0 

1.6 

.2780 

.4452 

.0548 

8.1 

17.2 

1.7 

.2357 

.4554 

.0446 

10.2 

21.4 

1.8 

.1979 

.4641 

.0359 

12.9 

26.8 

1.9 

.1645 

.4713 

.0287 

16.4 

33.8 

2.0 

.1353 

.1772 

.0228 

21. 

43. 

2.1 

.1103 

.4821 

.0179 

27. 

55. 

2.2 

.0889 

.4861 

.0139 

35. 

71. 

2.3 

.0710 

.4893 

.0107 

46. 

92. 

2.4 

.0561 

.4918 

.0082 

60. 

121. 

2.5 

.0439 

.4938 

.0062 

80. 

160. 

2.6 

.0340 

.49534 

.00466 

106. 

214. 

2.7 

.0261 

.49653 

.00347 

143. 

287. 

2.8 

.0198 

.19744 

.00256 

195. 

390. 

2.9 

.0149 

.49813 

.00187 

267. 

535. 

3.0 

.0111 

.49865 

.00135 

369. 

739. 

3.1 

.00819 

.49903 

.00097 

520 

1030 

3.2 

.00598 

.49931 

.00069 

730 

1450 

3.3 

.00433 

.499517 

.000483 

1030 

2070 

3.4 

.00309 

.499663 

.000337 

1480 

2970 

3.5 

.00219 

.499767 

.000233 

2150 

4300 

3.6 

.00153 

.499841 

.000159 

3100 

6300 

3.7 

.00107 

.499892 

000108 

4600 

9300 

3.8 

.00073 

.499928 

.000072 

6900 

11300 

3.9 

.00050 

.4999519 

.0000481 

10400 

20800 

4.0 

.00034 

.4999683 

.0000317 

15800 

31500 

4.1 

.000224 

.4999793 

.0000207 

24200 

48300 
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TABLE XLIV— (Conc/wded) 

Heights, Areas, and Corresponding Ratios op Areas op a Normal Dis- 
tribution Corresponding to Standard Deviation or Error Units 


O' or € 
Distance 
from 
Mean 

h, 

Height or 
Ordinate at 
Given o' or e 
Distance 

Area Between 
Ordinate at 
Given <r or « 
Distance 
and Mean 

Area Beyond 
Ordinate at 
Given <r or « 
Distance 
from Mean 

Ratio of Area 
Between 
Ordinate 
and Mean to 
Area Beyond 
Ordinate 

PAtio of Area 
on Larger 
Side of 
Ordinate 
to Area on 
Smaller Side 

4.2 

.000148 

.4999867 

.0000133 

37600 

75200 

43 

.000097 

.4999915 

.0000085 

59000 

118000 

4.4 

.000062 

.4999946 

.0000054 

93000 

185000 

4.5 

.000040 

.49999660 

.00000340 

147000 

294000 

4.6 

.000025 

.49999789 

.00000211 

237000 

473000 

4.7 

.000016 

.49999870 

.00000130 

384000 

769000 

4.8 

.000010 

.49999921 

.00000079 

630000 

1260000 

4.9 

.000006 

.49999952 

.00000048 

1040000 

2090000 

5.0 

.000004 

.49999971 

.00000029 

1740000 

3490000 
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TABLE XLV 

Heights, Areas, and Corresponding Ratios op Areas op a Normal Dis- 
tribution Corresponding to Median Deviation or 
Probable Error Units 


MdD 
or PE 
Distance 
from 
Mean 



K 

Height or 
Ordinate at 
Given MdD 
or PE 
Distance 

Area Between 
Ordinate at 
Given MdD 
or PE 
Distance 
and Mean 

1.000) 

.0000 

.9977 

.0269 

.9909 

.0537 

.9797 

.0802 

.9643 

.1063 

.9447 

.1320 

.9214 

.1571 

.8945 

.1816 

.8645 

.2053 

.8317 

.2281 

.7965 

.2500 

.7594 

.2709 

.7207 

.2909 

.6808 

.3097 
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TABLE XIN--C(mdvded 

Heights, Areas, and Correspondino Ratios of Areas of a Normal Dis- 
tribution CORRESPONDINa TO MEDIAN DEVIATION OR 
Probable Error Units 


MdD 
ox PE 
Distance 
from 
Mean 

K 

Area Between 

Height or 
Ordinate at 
Given MdD 
or PE 
Distance 

Ordinate at 
Given MdD 
or PE 
Distance 
and Mean 

4.2 

.0181 

.49769 

4.3 

.0149 

.49814 

4.4 

.0122 

.49850 

4.5 

.01000 

.49880 

4.6 

.00812 

.499041 

4.7 

.00657 

499238 

4.8 

.00530 

.^99397 

4.9 

.00425 

.499525 

5.0 

.00339 

.499627 

5.1 

.00269 

.499709 

5.2 

.00213 

.499774 

5.3 

.00168 

.499825 

5.4 

.00132 

.499865 

5.5 

.00103 

.499896 

5.6 

.000798 

.4999207 

5.7 

.000617 

.4999396 

5.8 

.000478 

.4999542 

5.9 

.000364 

1 .4999655 

6.0 

.000278 

.4999741 



.00120 

.000969 

.0C0762 

.000603 

.000475 

.000373 

.000291 

.000226 

.000175 

.000135 

.000104 

.0000793 

.0000604 

.0000458 

.0000345 

.0000259 






Index 


a, Symbol for frequency in upper left cell 
of Four-fold table. 

a (as subscript), Symbol for class above 
that in which assumed measure of cen- 
tral tendency is taken. 

Abilities derived from statistical study, 2. 
Abscissa, 33. 

Absolute accuracy, 7. 

Absolute error, 7. 

Absolute range, 83. 

Accidental error, 9. 

Accumulative error, 9. 

Accumulative graph, 40. 

Accuracy, 6; also see various Errors and 
Reliability. 

AD, Symbol for Average deviation. 
Adding machines, 4. 

Aids to computation, 4. 

Alienation, coefficient of, 142. 

Allokurtic curve, 50. 

a (alpha) , Symbol for fraction of cases in 
upper left cell of Four-fold table. 
Analysis of co- variance, 93. 

Analysis of variance, 93. 

Apparent mode, 63. 

Area under normal curve, 199, 257. 
Arithmetic average, 55. 

Arithmetic mean, 55. 

Assigning marks, 199. 

Assigning ranks, 160. 

AssM, Symbol for Assumed mean. 
Assumed location of grouped cases, 24. 
Assumed mean, 56, 108. 

Assumption of symmetrical distribution 
within a class, 24. 

Asymmetrical curve, 50. 

Attenuation, 153. 

Attribute, 16. 

Average, 55. 

Average deviation, 85. 

Average, error of an, 9. 

Average variation. Same as Mean devia- 
tion. 

Averaging coefficients of correlation, 126. 
Averaging qualitative data, 203. 

Axes, 32. 

Ayres’s method of computing coefficients 
of correlation, 108. 

B, Symbol for Base for ratio score. 


5, Symbol for Coefficient of regression; 
also for frequency in upper right cell 
of Four-fold table. 

6 (as subscript), Symbol for class below 
that in which assumed measure of cen- 
tral tendency is taken. 

Barlow, Peter, 4. 

Base for ratio score, 223. 

Best fitting distribution, 207. 

(beta). Symbol for fraction of cases in 
upi)er right cell of Four-fold table; 
also for Regression coefficient, when 
Standard scores are used. 

Biased error, 9. 

Bi-modal distribution, 65. 

Binomial expansion, 47. 

Biologic curve, 46. 

Bi-scrial correlation, 75, 185. 

Blakeman’s criterion, 182. 

Boundary, 20. 

Broom, M. E., 13. 

C, Symbol for Coefficient of contingency. 

c, Symbol for Correction; also fo"^ fre- 
quency in lower left cell of Four-fold 
table. 

Calculating machines, 5. 

Caption, 114. 

Card-punching and automatic sorting 
machines, 6. 

CC, Symbol for Coefficient of contingency. 

Centilc, Same as Percentile. 

Central tendency, 55. 

Certainty of prediction, 142. 

Chance error, 9. 

Chance of significance, 246. 

Changing -ile ranks to ordinary ranks, 77. 

Changing qualitative data into a normal 
distribution, 203. 

Changing ranks to percentiles, 77. 

Changing rank to product-moment co- 
efficients of correlation, 164. 

Chesire, Leona, 192. 

X* (chi-square), Symbol for summation, in 
Test for goodness of fit. 

Class, 20. 

Class boundaries, 20. 

Classes, number of, 21. 

Classical sampling formulas, 232. 

Classifying data, 16. 
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INDEX 


Class index, 24. 

Class interval, 20. 

Class limits, 19, 22, 27. 

Class mark, 24. 

Class midpoint, 21, 24, 27. 

Class type, 24. 

Class width, 20. 

Coefficient of alienation, 142. 

Coefficient of bi-serial correlation, 185. 
Coefficient of contingency, 188, 195, 215, 
Coefficient of correlation, 105, 252. 
Coefficient of curvilinear correlation, 179. 
Coefficient of dependability, 142. 
Coefficient of determination, 146. 
Coefficient of mean square contingency, 
188. 

Coefficient of multiple correlation, 170. 
Coefficient of multiple regression, 174. 
Coefficient of net correlation, 167. 
Coefficient of net regression, 173. 
Coefficient of non-determination, 146. 
Coefficient of partial correlation, 167. 
Coefficient of partial regression, 173. 
Coefficient of rank correlation, 161. 
Coefficient of regression, 137. 

Coefficient of reliability, 92, 150. 
Coefficient of self-correlation, 150. 
Coefficient of variability, 100. 

Coefficient of variation, 100. 

C of L, Symbol for Criterion of linearity. 
C of V, Symbol for Coefficient of vari- 
ability. 

Column diagram, 34. 

Column heading, 114. 

Common factor in correlation, 130. 
Comparable measures, 222. 

Comparing an actual with an ideal dis- 
tribution, 207. 

Comparing coefficients of correlation, 127. 
Compensating error, 9. 

Competent statistical worker, 3. 
Computational aids, 4. 

Computed mode, 64. 

Computing machines, 5. 

Computing tables, 4. 

Confidence, 250. 

Constant error, 9. 

Construction of normal curve, 47. 
Contingency, coefficient of, 188, 195, 215. 
Continuous variable, IJ* 

Cooke, Dennis H., 13. 

Cobrdinate axis system, 32. 

Correction, 57. 

Correction for attenuation, 153. 
Correction of bi-serial coefficient of corre- 
lation, 187. 

Correction of coefficient of contingency 
191. 

Correction of coefficient of correlation, 
127, 133. 


Correction of coefficient of multiple 
correlation, 172. 

Correction of iNT, 238. 

Correction of phi coefficient of correla- 
tion, 195. 

Correction of ratio of correlation, 183. 
Correction of standard deviation, 91. 
Correlation, 104, 137, 167, 179. 
Correlation charts, 121. 

Correlation of attributes, 183. 

Correlation ratio, 179. 

Correlation table, 114. 

Co-variance, 93. 

Cowden, Dudley J., 255. 

CR, Symbol for Critical ratio. 

Crawford, Claude C., 13. 

Criterion of linearity, 182. 

Critical ratio, 244. 

Croxton, Frederick E., 255. 

Crude mode, 63. 

Culler’s interpretation of P, 211. 
Cumulative curve, 40. 

Cumulative distribution, 28. 

Cumulative error, 9. 

Cumulative frequency, 28. 

Cumulative graph, 40. 

Cumulative percent, 28. 

Cumulative table, 28. 

^ (curl). Symbol for Parameter, 

Curve fitting, 51. 

Curve of error, 46. 

Curve of organic decay, 51. 

Curve of organic growth, 51. 

Curve of probability, 46. 

Curve of regression, 137. 

Curvilinear correlation, 179. 

Curvilinear relationship, 179. 

D, Symbol for Difference in rank. 

Dio. BO, Symbol for 10-90 percentile range, 
d. Symbol for Deviation; also for fre- 
quency in lower right cell of Four-fold 
table. 

Dec, Symbol for Decile. 

Decile, 71. 

Decimal places, dropping, 11. 

Decimal places, number of, 11. 

Degree of confidence, 250. 

Degrees of freedom, 211, 215. 

5 (delta). Symbol for fraction of cases in 
lower right cell of Four-fold table. 
Departure, 83. 

Dependability, coefficient of, 142. 
Dependent variable, 141. 

Descriptive statistics, 227. 
Determination, coefficient of, 146. 
Deviation, 57, 83. 

Difference, reliability of, 242. 

Difficulty of an clement, 218. 

Direct correlation, Same as Positive 
correlation. 
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Discontinuous variable, 17. 

Discrepancy, 83. 

Discrete variable, 17. 

Dispersion, 83. 

Displacement associated with coefficient 
of correlation, 130. 

Distribution, 19. 

Distribution of marks, 199. 

Distribution within class, 24. 

Doolittle method of multiple correlation, 
170. 

Double entry table, 114. 

Double frequency table, 114. 

Dropping decimal places, 11. 

Dunlap, Jack W., 5. 

Et Symbol for Coefficient of depend- 
ability. 

Eg^ Symbol for Coefficient of depend- 
ability based on pure chance. 

Emt Symbol for Coefficient of depend- 
ability based on mean, 
c. Symbol for base of Napierian logarithms, 
2.7183. 

ECf Symbol for Experimental coefficient. 
Edgerton, Harold A., 12. 

Efficiency of prediction, 142. 

Element, difficulty of an, 218. 

Element validity, 187. 

Empirical mode, 63. 

Engelhart, Max D., 13, 15, 31, 70, 81, 
103, 136, 149, 159, 178, 226, 256. 
Enlow’s slide rule, 6. 
c (epsilon) , Symbol for Standard error, 
test, Symbol for Standard error of esti- 
mate. 

cmoas. Symbol for Standard error of 
measurement. 

€ 1 - 2 , Symbol for Standard error of estimate. 
«i*oo. Symbol for Standard error of meas- 
urement. 

Equally noticed differences, 219. 
Equation of multiple regression, 174. 
Equation of partial regression, 173. 
Equation of regression, 137. 

Equivalent measures, 222. 

Equivalent score equations, 222. 

Error curve, 46. 

Error of a difference, 242, 

Error of an average, 9. 

Error of a power, 10. 

Error of a product, 9. 

Error of a quotient, 9. 

Error of a root, 10. 

Error of a score, 155. 

Error of a sum, 8. 

Error of estimate, 140, 143. 

Error of matched samples, 252. 

Error of measurement, 155, 238. 

Error of observation, 155. 

Error of prediction, 140. 


Error of sampling, 228. 

Error, systematic, 9. 

Error, variable, 9. 

Estimate, error of, 140, 143. 

Estimated mode, 64. 

Estimated true score, 147. 
i; (eta). Symbol for Ratio of correlation. 
Exercises and problems, 14, 30, 52, 68, 
79, 103, 134, 147, 158, 166, 177, 195, 
224, 254. 

Experimental coefficient, 245. 

Ezekiel, Mordecai, 14, 136, 148, 177, 197, 
256. 

/, Symbol for Frequency. 

Face value, 24. 

Fallible score, 147. 

First order coefficient of correlation, 168. 
First quartile, 71. 

Fisher, R. A., 14, 15, 53, 136, 178, 226, 
229, 256. 

Fit, goodness of, 182, 210. 

Fluctuation, 83. 

“Footrule” correlation, 161. 

Four-fold table, 192. 

Freedom, degrees of, 211, 215. 

Frequency, 19. 

Frequency curve, 34. 

Frequency distribution, 19. 

Frequency graph, 34. 

Frequency polygon, 36. 

<?, Symbol for Geometric mean, 
flf, Symbol for Gain in rank; aUo for 
chance element connected with Co- 
efficient of correlation. 

Gain in rank, 162. 

y (gamma) , Symbol for fraction of cases 
in lower left cell of Four-fold table. 
Garrett, H. E., 13, 15, 53, 70, 80, 103, 
136, 148, 159, 166, 178, 197, 226, 256. 
Gaussian curve, 46. 

General suggestions as to procedure, 2. 
Geometric mean, 65. 

GM, Symbol for Geometric mean. 
Gompertz curve, 51. 

Good, Carter V., 12. 

Goodness of fit, 182, 210. 

Graduated curve, 38. 

Graduating, method of, 38. 

Graphing, 32. 

Gray, Clarence T., 13. 

Group, 20. 

Group error, 9. 

Grouping data, 16. 

Guessed mean, 57. 

Guilford. J. P., 13, 15, 31, 54, 70, 80, 103, 
136, 148, 159, 166, 178, 198, 226, 256. 

Symbol for Harmonic mean. 
h. Symbol for Height of normal curve 
when Maximum ordinate is 1.00. 
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Hannonio mean, 67. 

Height of normal curve, 47, 257. 
Histogram, 34. 

HMt Symbol for Harmonic mean. 
Holsinger, Karl J., 5, 14. 

i, Symbol for Interval. 

/p, Symbol for Index of prediction. 

-Ue points, 71. 

Increase, rate of, 66. 

Independence table, 211. 

Independent variable, 141. 

Index of forecasting efficiency. Same as 
Coefficient of dependability. 

Index of prediction, 142. 

Index of reliability, 155. 

Indirect correlation. Same ae Negative 
correlation. 

Individual error, 9. 

00 (infinity) (as subscript). Symbol for 
True score. 

Inspectional mode, 63. 

Intercorrelation, 124. 

Interpretation of coefficients of correla- 
tion, 129, 142. 

Interval, 20. 

Inverse correlation. Same as Negative 
correlation. 

Isokurtic curve, 40. 

Item validity, 185. 

Jackson, Robert W., 133. 

J-curve, 51. 

Judges, number of, 221. 

Judges’ ratings, reliability ’of, 221. 

kt Symbol for Coefficient of alienation. 
fc». Symbol for Coefficient of non-determi- 
nation. 

Kelley, Truman Lee, 5, 14. 

Kenney, John F., 14, 15, 31, 54, 70, 103, 
136, 148, 166, 178, 198, 226, 256. 

KUt Symbol for Kurtosis. 

Kurtosis, 49, 102. 

Kurtz, Albert K., 5, 12. 

f. Symbol for Lower limit. 

Laplacean curve, 46. 

Large sample theory, 230. 

Least squares, method of, 141. 

Length and reliability, 151. 

Leptokurtic curve, 49. 

Level of confidence, 250. 

Limitations upon coefficients of correla- 
tion, 126. 

Limit of accuracy, 8. 

Lindquist, E. F., 13, 15, 54, 70, 81, 103, 
136, 148, 159, 166, 178, 198, 226, 229, 
256. 

Linearity, criterion of, 182. 

Linear relationship, 105. 


Line of best fit, 141. 

Line of regression, 138. 

If, Symbol for Lower limit. 

Location of grouped oases, 24. 
Logarithms, 11, 66. 

Long method of computing mean, 56. 
Lower limit, 19. 

Lower quartile, 71. 

M, Symbol for Mean. 

Mot Symbol for Geometric mean. 

MHt Symbol for Harmonic mean. 

Major mode, 65. 

Marks, distribution of, 199. 

Matched samples, errors of, 252. 
Maximum ordinate, 47. 

McCall, W. A., 219. 

MDt Symbol for Mean deviation. 

Mdf Symbol for Median. 

MdDf Symbol for Median deviation. 

Mdn, Symbol for Median. 

MEt Symbol for Mean error. 

Mean, 55, 67. 

Mean absolute deviation. Same as Mean 
deviation. 

Mean deviation, 85, 94. 

Mean error, 232. 

Mean, geometric, 65. 

Meaning of numerical data, 26. 

Mean square deviation, 93. 

Measurement, error of, 155, 238. 
Measures of central tendency, 55. 

Medf Symbol for Median. 

Median, 59, 68. 

Median deviation, 93, 94. 

Mesokurtic curve, 49. 

Method of attributes, 16. 

Method of least squares, 141. 

Method of smoothing, 38. 

Method of variables, 16. 

Midmeasure, 63. 

Midpoint of a class, 21, 24, 27. 

Midscore, 63. 

Minor mode, 65. 

Mo, Symbol for Mode. 

Mode, 63, 68, 136. 

Mode, Elmer B., 14, 15, 31, 54, 70, 81, 
103, 148, 166, 178, 198, 226, 256. 
Monroe, Walter S., 12, 13, 15, 31, 70, 81, 
103, 136, 149, 159, 178, 198, 226, 256. 
Moving averages, method of, 38. 
Multi-modal distribution, 65. 

Multiple correlation, 170. 

Multiple regression, 174. 

Multiple regression coefficients, 175. 

iV, Symbol for Total number of cases, 
n. Symbol for Number of classes; also for 
Total number of classes; also /or Num- 
ber of variables. 

Negative correlation, 104. 
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Negative skewness, 51. 

Net correlation, 167. 

Net regression, 173. 

Nomogram, Same as Nomograph. 
Nomograph, 6. 

Non-componsating error, 9. 
Non-cumulative error, 9. 
Non-determination, coefficient of, 146. 
Non-linear relationship. Same aa Curvi- 
linear relationship. 

Non-quantitative classification, 16. 
Non-symmetrical curve, 60. 

Normal curve, 46, 61, 94, 199, 267. 
Normal frequency curve, 46, 61, 04, 199, 
267. 

Normal ogive, 61. 

Normal surface, division of, 199, 267. 

Null hypothesis, 261. 

Number of classes, 21. 

Number of judges, 220. 

Numerical classification, 16. 

Numerical data, meaning of, 26. 

O, Symbol for Origin. 

Observation, error of, 166. 

Obtained score, 147. 

Ogive, 42. 

Ordered classification, 16. 

Order of coefficients of correlation, 108. ' 
Ordinate, 33. 

Organic decay, curve of, 61. 

Organic growth, curve of, 51. 
Over-refinement of procedures, 3. 
Overlapping, 74. 

P, Symbol for Percentile; aUo for Proba- 
bility. 

p. Symbol for numerator of formula for 
Product-moment correlation; also for 
larger of two fractions into which a 
whole is divided. 

Parabola, 51. 

Parameter, 227. 

Part correlation, 170. 

Partial correlation, 167. 

Partial regression, 173. 

Partial regression coefficients, 176. 

Partial standard deviation, 168. 

Partial sum, 60. 

PE, Symbol for Probable error. 

PEesu Symbol for Probable error of 
estimate. 

PEmeast Symbol for Probable error of 
measurement. 

PEi-i, Symbol for Probable error of 
estimate. 

PEi.oot Symbol for Probable error of 
measurement. 

Pearson, Karl, 6, 61, 182. 

Pearsonian correlation. Same aa Product' 
QQoment correlation. 


Per, Symbol for Percentile. 

Percentile, 42, 71. 

Percentile curve, 42. 

Percentile rank, 76, 223. 

Percentile score, 75, 223. 

Peters, Charles C., 14, 15, 64, 70, 103, 
136, 149, 159, 166, 178, 198, 211, 226, 
256. 

<f> (phi). Symbol for Phi coefficient of 
correlation. 

Phi coefficient of correlation, 194. 

Phillips, Alexander J., 133. 

V (pi). Symbol for ratio of circumference 
of circle to diameter, 3.1416. 

Planning statistical work, 3. 

Platykurtic curve, 49. 

Point measures, 65, 71. 

Population, 227. 

Positive correlation, 104. 

Positive skewness, 61. 

Power, error of a, 10. 

PR, Symbol for Percentile rank. 

Practical certainty, 245. 

Prediction, error of, 140. 

Prediction, index of, 142. 

Probability curve, 46. 

Probable deviation, 93. 

Probable error, 94, 143, 165, 232. 

Product, error of a, 9. 

Product-moment correlation, 105, 191. 
Prophecy formula, 151. 

Purpose of book, 1. 

Q, Symbol for Quartile deviation. 

q, Symbol for smaller of two fractions into 
which a whole is divided. 

(?i. Symbol for First quartile. 

Qz, Symbol for Third quartile. 

QD, Symbol for Quartile deviation. 
Quadrant, 32. 

Quadratic equation, solution of, 194. 
Qualitative classification, 16. 

Quantitative classification, 16. 

Quartile, 71. 

Quartile deviation, 84, 94. 

Quintile, 71. 

Quotient, error of a, 9. 

R, Symbol for Rank; also for “Footrule” 
coefficient of rank correlation. 

Symbol for Coefficient of multiple 
correlation. 

Symbol for Coefficient of multi- 
ple correlation. 

r. Symbol for Coefficient of product- 
moment correlation. 

r*. Symbol for Coefficient of determi- 
nation. 

TMs^ Symbol for [Bi-serial coefficient of 
correlation. 

rn» Symbol for Coefficient of reliability of 
test n times as long. 
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rtet, Synibol for Tetrachoric coefficient of 
correlation. 

rii, Symbol for Coefficient of reliability. 
Vni, Symbol for Index of reliability. 
nM 4 ...rt, Symbol for Coefficient of partial 
correlation. 

Random error, 9. 

Random numbers, 229. 

Random sampling, 228. 

Range, 83, 94. 

Range of talent, effect of, 127. 

Rank correlation, 160. 

Ranking, ICO. 

Ranks, 75, 223. 

Rate of change, 66. 

Rate of increase, 66. 

Ratio of correlation, 179. 

Ratio of significance, 243. 

Ratio scores, 223. 

Rectangular curve, 50. 

Rectilinear relationship, 105. 

References, 12, 15, 31, 53, 70, 80, 103, 
136, 148, 159, 166, 178, 226, 255. 
Refined mode, 64. 

Regression, 137, 167, 173. 

Regression coefficient, 137, 173. 
Regression curve, 137. 

Regression equation, 137, 175. 

Regression line, 138. 

Relationship, measurement of, 104. 
Relative accuracy, 7. 

Reliability, coefficient of, 150. 
VUeliability, index of, 1 55. 

Reliability of coefficients of correlation, 
. 252. 

^Reliability of difference, 242. 

Reliability of judges’ ratings, 153, 221. 
Reliability of scores, 150. 

Retaining figures, fl. 

Retest coefficient, 150. 
p (rho), Symbol for Coefficient of rank 
correlation based on squares of differ- 
ences. 

Rolling averages, method of, 38. 

Root, error of a, 10. 

Root-mean-square deviation, 88. 
‘‘Rounding off” data, 7. 

iS, Symbol for Partial sum. 

Symbol for summation of squares from 
assumed mean. 

Saffir, Milton, 192. 

Sampling, 227. 

Sampling by stratification, 229. 

Sampling by subdivision, 229. 

Sampling error, 228. 

Scale of graphs, 33. 

Scatter, 83. 

Score, error of a, 155. 

SD, Symbol for Standard deviation. 


Second order coefficient of correlation, 
168. 

Second quartile, 71. 

Selection of sample, 228. 

Self-correlation, coefficient of, 150. 
Semi-interquartile range, 84. 

Semi-partial correlation, 170. 

Sheppard’s Table, 5. 

Short method of computing mean, 56. 
Shrinkage of coefficients of correlation, 
172. 

S (sigma), Symbol for summation. 
c (sigma). Symbol for Standard deviation; 

also for Standard error. 

<r2. Symbol for Variance. 

0 ‘est, Symbol for Standard error of estimate. 
ameas, Symbol for Standard error of 
measurement. 

Symbol for Standard error of esti- 
mate. 

0 - 1 . CO, Symbol for Standard error of meas- 
urement. 

c (sigma) score, 141, 201, 222. 

Significance ratio, 243. 

Significant digit, 8. 

Significant figure, 8. 

Sk, Symbol for Skewness. 

Skew curve, 50. 

Skewed curve, 50. 

Skewness, 50, 101. 

Slide rules, 5. 

Small sample theory, 231, 246. 

Smoothed curve, 38. 

Smooth frequency curve, 37. 

Smoothing, 38. 

Sorting machines, 6. 

Spearman-Brown formula, 151. 
Split-halves method, 152. 

Spread, 83. 

Spurious correlation, 126. 

“Stair-st^pp” curve, 41. 

Standard deviation, 47, 88, 94. 

Standard error, 143, 155, 231. 

Standard score, 141, 201, 222. 

Statistic, 227. 

Statistical inference, 227. 

Statistical procedures, 2. 

Step, 21. 

Straight line, 51. 

Stratification, sampling by, 229. 

Stub, 114. 

Subdivision, sampling by, 229. 

Sum, error of a, 8. 

Symmetrical curve, 49. 

Symmetrical distribution within a class, 
24. 

Systematic error, 9. 

7*, Symbol for Total. 

Symbol for a statistic divided by its 
standard error. 
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Table of double entry, 114. 

Tables for use in computing, 4. 
Tabulating machines, 6. 

10-90 percentile range, 85, 94, 

Ter, Symbol for Tertile. 

Terman, L. M., 219. 

Tertile, 71. 

Test for goodness of fit, 182, 210. 

Test scoring machines, 6. 

Tetrachoric correlation, 192. 
Theoretical mode, 64. 

Third quartile, 71. 

Thorndike, E. L., 219. 

Thurstone, L. L., 192. 

Tiegs, Ernest W., 13. 

Time units, 67. 

Total range, 83, 94. 

Transmutation of measures, 222. 

True mean, 56. 

True measure, 92. 

True mode, 64. 

True score, 147. 

True standard deviation, 92. 

T-scale, 219. 

T-score, 219. 

Two-fold classification, 16. 

u. Symbol for Upper limit. 

U-curve, 50. 

ul. Symbol for Upper limit. 

Unbiased error, 9. 

Unequal intervals, 23. 

Uni-modal distribution, 65. 

Universe, 227. 

Unordered classification, 16. 

Upper limit, 20. 

Unper quartile, 71. 

Uses of normal curve, 199. 

Uses of statistics, 1. 

V, Symbol for Coefficient of variability. 


Value of first case in a class, 25. 

Values of grouped cases, 24. 

Van Voorhis, Walter R., 14, 15, 54, 70, 
103, 136, 149, 159, 166, 178, 198, 211, 
226, 256. 

Variability, 82. 

Variability, coefficient of, 100. 

Variable, 16, 141. 

Variable error, 9. 

Variance, 93. 

Variate, Same aa Variable. 

Variation, 83. 

Verbal classification, 16. 

Votaw, David F., 13. 

W, Symbol for Weight. 

Walker, Helen M., 12, 13, 15, 31, 54, 70, 
81, 103, 136, 149, 166, 226, 256. 
Weighting independent variables, 173. 
Width of class, 20. 

WUks, S. S., 14. 

Woo, T. L., 182. 

Work units, 67. 

X, Symbol for a Variable. 

X, Symbol for Deviation of X; also for 
Abscissa. 

JT-axis, 33. 

Y, Symbol for a Variable. 

y. Symbol for Deviation of Y; also for 
Ordinate. 

2 / 0 , Symbol for Maximum ordinate. 
Yates, F., 229. 

F-axis, 33. 

Z, Symbol for Mode. 

2 , Symbol for Height of normal curve 
when Maximum ordinate is .3989. 
2 -score, 141, 201, 222. 

Zero-order coefficient of correlation, 168. 
f (zeta). Symbol for — r*. 





